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BACTERICIDAL/PERMEABILITY-INCREASING PROTEIN: 
CRYSTALLIZATION, X-RAY DIFFRACTION, THREE- 
DIMENSIONAL STRUCTURE DETERMINATION, 
RATIONAL DRUG DESIGN AND MOLECULAR MODELING OF 

RELATED PROTEINS 

5 This is a continuation-in-part application of U.S. Application Serial No. 

08/879,565, filed June 20, 1997, which is hereby incorporated by reference in its 
entirety. 

The present invention generally pertains to the fields of protein crystallization, 
x-ray diffraction analysis, three-dimensional structural determination, rational drug 

10 design and molecular modeling of related proteins. The present invention solves the 
three-dimensional structure of bactericidal/permeability-increasing protein (BPI) and 
provides crystallization methods for BPI protein products. A crystallized BPI protein 
product was physically analyzed by x-ray diffraction techniques. The resulting x-ray 
diffraction patterns were of sufficiently high resolution to be useful for determining 

15 the three-dimensional structure of BPI and have yielded atomic coordinates for BPI. 
The present invention relates to uses of BPI coordinates for molecular modeling of 
related proteins and rational drug design (RDD) of mimetics and ligands for BPI and 
for related proteins. The present invention also relates to atomic coordinates of BPI, 
or portions thereof, to solve crystal forms of BPI proteins or their fragments, analogs, 

20 and variants thereof, or of related proteins, including lipid transfer proteins, or their 
fragments, analogs and variants. 

Part of the work performed during development of this invention utilized U.S. 
Government funds. The U.S. Government has certain rights in this invention. 

Background of the Invention 

25 Bactericidal/permeability-increasing protein (BPI) is a protein isolated from 

the granules of mammalian polymorphonuclear leukocytes (PMNs or neutrophils), 
which are blood cells essential in the defense against invading microorganisms. BPI 
is known to bind the lipopoly saccharide (LPS), a major component of the outer 
membrane of gram-negative bacteria that stimulates a potent inflammatory response. 

30 Human BPI protein has been isolated from PMNs by acid extraction combined with 
either ion exchange chromatography [Elsbach, J. BioL Chem., 254:11000 (1979)] or 
£. coli affinity chromatography [Weiss, et al., Blood, 69:652 (1987)]. BPI obtained 
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in such a manner is referred to herein as natural BPI and has been shown to have 
potent bactericidal activity against a broad spectrum of gram-negative bacteria. The 
molecular weight of human BPI is approximately 55,000 daltons (55 kD). The amino 
acid sequence of the entire human BPI protein and the nucleic acid sequence of DNA 
5 encoding the protein have been reported in Figure 1 of Gray et al., J. Biol. Chem., 
264:9505 (1989), incorporated herein by reference. The Gray et al. amino acid 
sequence is set out in SEQ ID NO: 1 hereto. U.S. Patent No. 5,198,541, EP0375724 
and WO89/10486 (PCT/US88/02700) disclose recombinant genes encoding and 
methods for expression of BPI proteins, including BPI holoprotein and fragments of 
10 BPI. 

A proteolytic N-terminal fragment of BPI having a molecular weight of about 
25 kD possesses essentially all the anti-bacterial efficacy of the naturally-derived 55 
kD human BPI holoprotein. [Ooi et al., J. Bio. Chem., 262: 14891-14894 (1987)]. 
In contrast to the N-terminal portion, the C-terminal region of the isolated human BPI 

15 protein displays only slightly detectable anti-bacterial activity against gram-negative 
organisms. [Ooi et al., J. Exp. Med., 174:649 (1991).] An N-terminal BPI fragment 
of approximately 23 kD, referred to as "rBPI^," has been produced by recombinant 
means and also retains anti-bacterial activity against gram-negative organisms. 
[Gazzano-Santoroetal., /n/ecf. Immun. 60:4754-4761(1992).] An N-terminal analog 

20 of BPI, rBPI 21 , has been produced as described in Horwitz et al., Protein Expression 
Purification, 5:28-40 (1996). 

The bactericidal effect of BPI has been reported to be highly specific to gram- 
negative species, e.g., in Elsbach and Weiss, Inflammation: Basic Principles and 
Clinical Correlates , eds. Gallin et al., Chapter 30, Raven Press, Ltd. (1992). This 

25 reported target cell specificity was believed to be the result of the strong attraction 
of BPI for LPS on the outer membrane (or envelope) of gram-negative organisms. 
Although BPI was commonly thought to be non-toxic for other microorganisms, 
including yeast and for higher eukaryotic cells, it has recently been discovered that 
BPI protein products exhibit activity against gram-positive bacteria, mycoplasma, 

30 mycobacteria, fungi, protozoa and chlamydia. 

The precise mechanism by which BPI kills gram-negative bacteria is not yet 
completely elucidated, but it is believed that BPI must first bind to the surface of the 
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bacteria through electrostatic and hydrophobic interactions between the cationic BPI 
protein and negatively charged sites on lipopoly saccharide. Bacterial LPS has been 
referred to as "endotoxin" because of the potent inflammatory response that it 
stimulates, i.e., the release of mediators by host inflammatory cells which may 

5 ultimately result in irreversible endotoxic shock. BPI binds to lipid A, reported to 
be the most toxic and most biologically active component of LPS. 

In susceptible gram-negative bacteria, BPI binding is thought to disrupt LPS 
structure, leading to activation of bacterial enzymes that degrade phospholipids and 
peptidoglycans, altering the permeability of the cell's outer membrane, and initiating 

10 events that ultimately lead to cell death. [Elsbach and Weiss (1992), supra], BPI has 
been proposed to act in two stages. The first stage proposed is a sublethal one that 
is characterized by immediate growth arrest, penneabilization of the outer membrane 
and selective activation of bacterial enzymes that hydrolyze phospholipids and 
peptidoglycans. Bacteria at this stage could be rescued by growth in serum albumin 

15 supplemented media [Mannion et ah, J. Clin. Invest., 55:853-860 (1990)]. The 
second stage, defined by growth inhibition that cannot be reversed by serum albumin, 
is proposed to occur after prolonged exposure of the bacteria to BPI and characterized 
by extensive physiologic and structural changes, including apparent damage to the 
inner cytoplasmic membrane. 

20 Initial binding of BPI to LPS leads to organizational changes that probably 

result from binding to the anionic groups of LPS, which normally stabilize the outer 
membrane through binding of Mg ++ and Ca ++ . Attachment of BPI to the outer 
membrane of gram-negative bacteria produces rapid penneabilization of the outer 
membrane to hydrophobic agents such as actinomycin D. Binding of BPI and 

25 subsequent gram-negative bacterial killing depends, at least in part, upon the LPS 
polysaccharide chain length, with long O-chain bearing, "smooth" organisms being 
more resistant to BPI bactericidal effects than short O-chain bearing, "rough" 
organisms [Weiss et al., J. Clin. Invest. 65: 619-628 (1980)]. This penneabilization 
of the gram-negative outer envelope is reversible upon dissociation of the BPI, a 

30 process requiring high concentrations of divalent cations and synthesis of new LPS 
[Weiss et al., J. Immunol. 132: 3109-3115 (1984)]. Loss of gram-negative bacterial 
viability, however, is not reversed by processes which restore the envelope integrity, 
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suggesting that the bactericidal action is mediated by additional lesions induced in the 
target organism and which may be situated at the cytoplasmic membrane (Mannion 
et al., J. Gin. Invest. 86: 631-641 (1990)). Specific investigation of this possibility 
has shown that on a molar basis BPI is at least as inhibitory of cytoplasmic membrane 
5 vesicle function as polymyxin B but the exact mechanism as well as the relevance of 
such vesicles to studies of intact organisms was not elucidated (In't Veld, et al., 
Infection and Immunity 56: 1203-1208 (1988)). 

BPI is a member of a gene/protein family of lipopolysaccharide binding and 
lipid transfer proteins whose other currently known members include 

10 lipopolysaccharide binding protein (LBP), cholesteryl ester transfer protein (CETP) 
and phospholipid transfer protein (PLTP). 

BPI protein products (which include naturally and recombinantly produced BPI 
protein; natural, synthetic, and recombinant biologically active polypeptide fragments 
of BPI protein; biologically active polypeptide variants of BPI protein or fragments 

15 thereof, including hybrid fusion proteins and dimers; biologically active polypeptide 
analogs of BPI protein or fragments or variants thereof, including cysteine-substituted 
analogs; and BPI-derived peptides) have been demonstrated to have a variety of 
beneficial activities. BPI protein products are known to be bactericidal for gram- 
negative bacteria, as described in U.S. Patent Nos. 5,198,541 and 5,523,288, both 

20 of which are incorporated herein by reference. BPI protein products are also known 
to enhance the effectiveness of antibiotic therapy in gram-negative bacterial infections, 
as described in U.S. Patent No* 5,523,288 and corresponding International 
Publication No. WO 95/08344 (PCT/US94/ 11225), which are incorporated herein by 
reference. BPI protein products are also known to be bactericidal for gram-positive 

25 bacteria and mycoplasma, and to enhance the effectiveness of antibiotics in gram- 
positive bacterial infections, as described in U.S. Patent No. 5,578,572 and 
corresponding International Publication No. WO 95/19180 (PCT/US95/00656), which 
are incorporated herein by reference. BPI protein products are further known to 
exhibit anti-fungal activity, and to enhance the activity of other anti-fungal agents, as 

30 described in U.S. Patent No. 5,627,153 and corresponding International Publication 
No. WO 95/19179 (PCT/US95/00498), and further as described for anti-fungal 
peptides in co-pending U.S. Application Serial No. 08/621,259 filed March 21, 1996, 
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which is in turn a continuation-in-part of U.S. Application Serial No. 08/504,841 
filed July 20, 1994 and corresponding International Publication Nos. WO 96/08509 
(PCT/US95/09262) and WO 97/04008 (PCT/US96/03845), all of which are 
incorporated herein by reference. BPI protein products are further known to exhibit 

5 anti-protozoan activity, as described in U.S. Patent No. 5,646,1 14 and corresponding 
International Publication No. WO 96/01647 (PCT/US95/08624), all of which are 
incorporated herein by reference. BPI protein products are known to exhibit anti- 
chlamydial activity, as described in co-pending U.S. Application Serial No. 
08/694,843 filed August 9, 1996 and corresponding International Publication No. WO 

10 98/06415 (PCT/US97/13810), all of which are incorporated herein by reference. 
Finally, BPI protein products are known to exhibit anti-mycobacterial activity, as 
described in co-pending U.S. Application Serial No. 08/626,646 filed April 1, 1996, 
which is in turn a continuation of U.S. Application Serial No. 08/285,803 filed 
August 14, 1994, which is in turn a continuation-in-part of U.S. Application Serial 

15 No. 08/031,145 filed March 12, 1993 and corresponding International Publication 
No. WO94/20129 (PCT/US94/02463), all of which are incorporated herein by 
reference. 

The effects of BPI protein products in humans with endotoxin in circulation, 
including effects on TNF, IL-6 and endotoxin are described in U.S. Patent Nos. 

20 5,643,875 and 5,753 f 620 and corresponding International Publication No. WO 
95/19784 (PCT/US95/01151), all of which are incorporated herein by reference, 

BPI protein products are also known to be useful for treatment of specific 
disease conditions, such as meningococcemia in humans (as described in co-pending 
U.S. Application Serial No. 08/644,287 filed May 10, 1996 and corresponding 

25 International Publication No. WO 97/42966 (PCT/US97/08016), which are 
incorporated herein by reference), hemorrhagic trauma in humans, (as described in 
co-pending U.S. Application Serial No. 08/862,785, a continuation-in-part of U.S. 
Serial No. 08/652,292 filed May 23, 1996, now U.S. Patent No. 5,756,464, and 
corresponding International Publication No. WO 97/44056 (PCT/US97/08941), all 

30 of which are incorporated herein by reference), burn injury (as described in U.S. 
Patent No. 5,494,896 and corresponding International Publication No. WO 96/30037 
(PCT/US96/02349), both of which are incorporated herein by reference), 
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ischemia/reperfusion injury (as described in U.S. Patent No. 5,578,568, incorporated 
herein by reference), and liver resection (as described in co-pending U.S. Application 
Serial No. 08/582,230 filed March 16, 1998 which is a continued prosecution 
application of the same serial no. filed January 3, 1996, which is in turn a 
5 continuation of U.S. Application Serial No. 08/318,357 filed October 5, 1994, which 
is in turn a continuation-in-part of U.S. Application Serial No. 08/132,510 filed 
October 5, 1993, and corresponding International Publication No. WO 95/10297 
(PCT/US94/ 11404), all of which are incorporated herein by reference). 

BPI protein products are also known to neutralize the anti-coagulant activity 

10 of exogenous heparin, as described in U.S. Patent No. 5,348,942, incorporated herein 
by reference, as well as to be useful for treating chronic inflammatory diseases such 
as rheumatoid and reactive arthritis, as described in U.S. Patent No. 5,639,727, 
incorporated herein by reference, and for inhibiting angiogenesis and for treating 
angiogenesis-associated disorders including malignant tumors, ocular retinopathy and 

15 endometriosis, as described in co-pending U.S. Application Serial Nos. 08/435,855, 
08/466,624 and 08/466,826, and corresponding International Publication No. WO 
94/20128 (PCT/US94/02401), all of which are incorporated herein by reference. 

BPI protein products are also known for use in antithrombotic methods, as 
described in U.S. Patent No. 5,741,779 and corresponding International Publication 

20 No. W097/42967 (PCT/US97/08017), which are incorporated herein by reference. 

U.S. Patent Nos. 5,420,019 and 5,674,834 and corresponding International 
Publication No. W094/18323 (PCT/US94/01235), all of which are incorporated 
herein by reference, disclose that the replacement of the cysteine residue at amino 
acid position 132 or 135 with another amino acid renders the resulting BPI 

25 polypeptide resistant to dimerization and cysteine adduct formation. It also discloses 
that terminating the N-terminal BPI fragment at BPI amino acid position 193 resulted 
in an expression product with reduced carboxy-terminal heterogeneity. 

Because of the multiplicity of valuable activities and uses of BPI protein 
products as exemplified above, a need continues to exist for new products with 

30 structures based on or mimicking a BPI protein product and having one or more of 
the activities and/or uses of BPI protein products, including use as anti-infective 
products, including antimicrobial agents (e.g., gram-negative bacteria [U.S. Patent 
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Nos. 5, 198,541 and 5,523,288; WO95/08344 (PCT/US94/1 1225)] and gram-positive 
bacteria [U.S. Patent No. 5,578,572; WO95/19180 (PCT/US95/00656)], fungi [U.S. 
PatentNo. 5,627,153; WO95/19179(PCT/US95/00498)], mycobacteria [EP0690721; 
WO94/20129 (PCT/US94/02463)] and chlamydia [WO96/01647 (PCT/US95/08624)] 

5 and endotoxin binding/neutralizing agents [WO95/019784 (PCT/US95/01151)], and 
as heparin binding/neutralizing products [U.S. Patent Nos. 5,348,942 and 5,639,727; 
WO94/20128 (PCT/US94/02401)], including for the neutralization of exogenously 
administered heparin, inhibition of angiogenesis (normal or pathological) for the 
treatment of chronic inflammatory disease states, and anticoagulant and thrombolytic 

10 agents for the treatment of thrombotic disorders [PCT/US97/08017]. All of the 
above-listed references regarding biological or functional activities of BPI, as well as 
therapeutic and diagnostic uses of BPI, are hereby incorporated by reference. One 
avenue of investigation toward solving the problem of new products based on BPI and 
fulfilling this need is the determination of the crystal structure of a BPI protein 

15 product. 

Summary of the Invention 
The present invention solves the above problem and fulfills the need for 
designating and making new and useful products based on BPI. It is an object of this 
invention to solve the three-dimensional structure of BPI and thereby provide the 
20 atomic coordinates (i.e., structure coordinates) of BPI from the analysis of x-ray 
diffraction patterns of sufficiently high resolution to be useful for determining the 
three-dimensional protein structure. 

It is an object of this invention to provide methods of expressing, purifying 
and crystallizing bactericidal/permeability-increasing protein (BPI) products, and 
25 thereby provide crystallized BPI proteins. 

It is an object of this invention to provide the use of the structure coordinates 
of a BPI crystal to allow the design of compounds for mimicking a BPI protein 
product to reveal the atomic details of ligand binding sites of BPI (e.g. lipid-like or 
heparin-like molecules). 
30 It is an object of this invention to provide use of the structure coordinates of 

a BPI crystal as described herein to solve the crystal structure of a crystal of a 
different BPI protein or fragment, analog or variant thereof, or a crystal of a related 
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protein, including a BPI-related lipid transfer protein or a fragment, analog or variant 
thereof. 

It is an object of this invention to provide mutants of BPI or fragments, 
analogs, or variants thereof characterized by one or more different properties as 
5 compared with wild-type BPI, These properties include altered surface charge, 
altered lipid binding pockets, altered specificity or higher activity. BPI mutants are 
useful to identify those amino acids that are most important for the lipid and heparin 
binding activity and other biological activities of BPI. This information, in turn, 
allows the design of new structures with one or more different properties based on 
10 BPI. 

It is an object of this invention to provide the use of the structure coordinates 
and atomic details of BPI as described herein or its fragments, analogs or variants 
(including mutants or co-complexes) or of a BPI-related lipid transfer protein or its 
fragments, analogs or variants (including mutants or co-complexes) to design, 

15 evaluate computationally, synthesize and use new structures based on BPI with 
desirable properties of BPI, such as physical and pharmacological properties. 

X-ray diffraction patterns of a related protein can be analyzed directly to 
provide the three-dimensional structure (if of sufficiently high resolution), however, 
the atomic coordinates for the crystallized BPI, as provided herein, can be used for 

20 structure determination. The x-ray diffraction patterns obtained by methods of the 
present invention, and provided on computer readable media, are used to provide 
electron density maps. The amino acid sequence is also useful for three-dimensional 
structure determination. The data is then used in combination with phase 
determination (e.g. using multiple isomorphous replacement (MIR) molecular 

25 replacement techniques) to generate electron density maps of BPI, using a suitable 
computer system. The electron density maps, provided by analysis of either the x-ray 
diffraction patterns or working backwards from the atomic coordinates, provided 
herein, are then fitted using suitable computer algorithms to generate secondary, 
tertiary and/or quaternary structures and/or domains of BPI, which structures and/or 

30 domains are then used to provide an overall three-dimensional structure, as well as 
binding and/or active sites of BPI. 
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It is also an object of this invention to specifically provide for the use of three- 
dimensional modeling of BPI and other members of the BPI protein family using the 
coordinates from the x-ray diffraction patterns. The coordinates and amino acid 
sequences are entered into one or more computer programs for molecular modeling. 

5 Such molecular modeling programs generate atomic coordinates that reflect the 
secondary, tertiary and/or quaternary structures of the protein which contribute to its 
overall three-dimensional structure and provide information related to binding and/or 
active sites of the protein. 

It is a further object of this invention to specifically provide for the use of 

10 similar molecular modeling for rational drug design (RDD) of mimetics and ligands 
of BPI and other members of the BPI protein family. The drug design paradigm uses 
computer modeling programs to determine potential mimetics and ligands which are 
expected to interact with sites on the protein. The potential mimetics or ligands are 
then screened for activity and/or binding. For BPI-related mimetics or ligands, 

15 screening methods can be selected from assays for at least one biological activity of 
BPI, e.g., anti-microbial, LPS- binding/neutralizing, heparin binding/neutralizing, 
and/or antithrombotic activities, according to known method steps. Similarly for 
LBP-, CETP- or PLTP-related mimetics or ligands, such screening methods can be 
selected from assays for at least one biological activity of LBP, CETP or PLTP, 

20 according to known method steps. The resulting mimetics or ligands are then 
provided by methods of the present invention and are useful for treating, inhibiting 
or preventing BPI-modulated diseases (or LBP-, CETP- and PLTP-modulated) in 
animals, including humans. 

Thus, as described herein, the present invention provides use of atomic 

25 coordinates of a BPI protein, or fragment, analog or variant thereof, to model a BPI 
protein or a related protein, including a BPI-related lipid transfer protein, such as 
LBP, CETP or PLTP, or fragment, analog or variant thereof. 

The present invention also provides use of atomic coordinates of a BPI protein 
wherein the BPI protein comprises a binding site characterized by amino acid residues 

30 of at least one binding pocket as defined in Table 3, and/or wherein the BPI protein 
comprises a binding site characterized by at least one amino acid sequence, or variant 
of the sequence, selected from positions about 17 to about 45, positions about 65 to 
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about 99 or positions about 142 to about 169 of BPI, or alternatively, a binding site 
characterized by at least one amino acid sequence, or variant of the sequence, selected 
from positions about 36 to about 54, positions about 84 to about 109 or positions 
about 142 to about 164 of BPL 
5 The present invention provides use of atomic coordinates of a BPI protein to 

computationally design a chemical compound for mimicking a BPI protein, or 
fragment, analog or variant thereof, or a BPI-related lipid transfer protein, or 
fragment, analog or variant thereof, including, for example, lipopolysaccharide- 
binding protein (LBP), cholesteryl ester transferase protein (CETP) or phospholipid 

10 transfer protein (PLTP), or fragment, analog, or variant thereof. 

The present invention also provides use of atomic coordinates of BPI protein 
to design a chemical compound capable of associating with a BPI-related lipid binding 
protein, or fragment, analog or variant thereof, including, for example, 
bactericidal/permeability-increasing protein (BPI), lipopolysaccharide-binding protein 

15 (LBP), cholesteryl ester transferase protein (CETP) or phospholipid transfer protein 
(PLTP), or fragment, analog or variant thereof. 

The present invention provides use of atomic coordinates of a BPI protein to 
design a model of ligands in an active site of a lipid binding protein, including, for 
example, BPI protein, lipopolysaccharide-binding protein (LBP), cholesteryl ester 

20 transferase protein (CETP) or phospholipid transfer protein (PLTP), or fragment, 
analog or variant thereof. 

The present invention provides use of atomic coordinates of a 
bactericidal/permeability-increasing ("BPI") protein, to design compounds with at 
least one activity selected from the group consisting of antibacterial, antifungal, 

25 antimycobacterial, antichlamydial, antiprotozoal heparin-binding, endotoxin-binding, 
heparin-neutralizing, endotoxin-neutralizing, inhibition of tumor and endothelial cell 
proliferation, inhibition of angiogenesis, anti-inflammatory, anticoagulant and 
antithrombotic. The coordinates disclosed herein are suitable for all of the 
aforementioned uses of atomic coordinates.. 

30 The present invention provides a method of three-dimensional modeling of a 

BPI protein or BPI-related lipid transfer protein comprising the steps of: (a) providing 
three-dimensional atomic coordinates derived from X-ray diffraction measurements 
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of a BPI protein in a computer readable format; (b) inputting the data from step (a) 
into a computer with appropriate software programs; and (c) generating a three- 
dimensional structural representation of the BPI protein or BPI-related lipid transfer 
protein suitable for visualization and further computational manipulation; particularly 

5 wherein the BPI protein comprises a binding site characterized by amino acid residues 
of at least one binding pocket as defined in Table 3, or wherein the BPI protein 
comprises a binding site characterized by at least one amino acid sequence, or variant 
of the sequence, selected from positions about 17 to about 45, positions about 65 to 
about 99 or positions about 142 to about 169 of BPI or wherein the BPI protein 

10 comprises a binding site characterized by amino acid residues of at least one binding 
pocket as defined in Table 3 and a binding site characterized by at least one amino 
acid sequence, or variant of the sequence, selected from positions about 36 to about 
54, positions about 84 to about 109 or positions about 142 to about 164 of BPI, or 
alternatively from about positions 36 to about 54, from about positions 84 to about 

15 109, or about positions 142 to about 164. 

The present invention provides a method for providing an atomic model of a 
BPI protein, BPI-related lipid binding protein, or fragment, analog or variant thereof, 
comprising (a) providing a computer readable medium having stored thereon atomic 
coordinate/x-ray diffraction data of a BPI protein, or fragment, analog or variant 

20 thereof, in crystalline form, the data sufficient to model the three-dimensional 
structure of the BPI protein, or fragment, analog or variant thereof; (b) analyzing, on 
a computer using at least one subroutine executed in said computer, atomic 
coordinate/x-ray diffraction data from (a) to provide atomic coordinate data output 
defining an atomic model of said BPI protein, BPI-related binding lipid protein or 

25 fragment, analog or variant thereof, said analyzing utilizing at least one computing 
algorithm selected from the group consisting of data processing and reduction, auto- 
indexing, intensity scaling, intensity merging, amplitude conversion, truncation, 
molecular replacement, molecular alignment, molecular refinement, electron density 
map calculation, electron density modification, electron map visualization, model 

30 building, rigid bo f dy refinement, positional refinement; and (c) obtaining atomic 
coordinate data defining the three-dimensional structure of at least , one of said BPI 
protein, BPI-related lipid binding protein, or fragment, analog or variant thereof; 
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particularly wherein said computer readable medium further has stored thereon data 
corresponding to a nucleic acid sequence or an amino acid sequence data comprising 
at least one structural domain or functional domain of a BPI, LBP, CETP or PLTP 
corresponding to at least one BPI or mutant primary sequence or fragment, analog or 
5 variant thereof; and wherein said analyzing step further comprises analyzing said 
sequence data. 

The present invention provides a computer-based system for providing atomic 
model data of the three-dimensional structure of a BPI protein, BPI-related lipid 
binding protein or fragment, analog or variant thereof, comprising the following 

10 elements: (a) at least one computer readable medium (CRM) having stored thereon 
atomic coordinate/x-ray diffraction data of a BPI protein, or fragment, analog or 
variant thereof; (b) at least one computing subroutine that, when executed in a 
computer, causes the computer to analyze atomic coordinate/x-ray diffraction data 
from (a) to provide atomic coordinate data output defining an atomic model of a BPI 

15 protein, BPI-related lipid binding protein or fragment, analog or variant thereof, said 
analyzing utilizing at least one computing subroutine selected from the group 
consisting of data processing and reduction, auto-indexing, intensity scaling, intensity 
merging, amplitude conversion, truncation, molecular replacement, molecular 
alignment, molecular refinement, electron density map calculation, electron density 

20 modification, electron map visualization, model building, rigid body refinement, 
positional refinement; and (c) retrieval means for obtaining atomic coordinate output 
data substantially defining the three-dimensional structure of said BPI protein, BPI- 
related lipid binding protein or fragment, analog or variant thereof. 

The present invention provides a method for providing a computer atomic 

25 model of a ligand of a BPI protein, BPI-related lipid binding protein, or fragment, 
analog or variant thereof, comprising: (a) providing a computer readable medium 
(CRM) having stored thereon atomic coordinate data of a BPI protein, or fragment, 
analog, or variant thereof; (b) providing a CRM having stored thereon atomic 
coordinate data sufficient to generate atomic models of potential ligands of said BPI 

30 protein, BPI-related lipid binding protein, or fragment, analog, or variant thereof; (c) 
analyzing on a computer, using at least one subroutine executed in said computer, the 
atomic coordinate data from (a) and ligand data from (b), to determine binding sites 
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of BPI protein, BPI-related lipid binding protein, or fragment, analog, or variant 
thereof, and to provide atomic coordinate data defining an atomic model of at least 
one ligand of said BPI protein, BPI-related lipid binding protein, or fragment, analog 
or variant thereof, said analyzing utilizing computing subroutines selected from the 

5 group consisting of data processing and reduction, auto-indexing, intensity scaling, 
intensity merging, amplitude conversion, truncation, molecular replacement, 
molecular alignment, molecular refinement, electron density map calculation, electron 
density modification, electron map visualization, model building, rigid body 
refinement, positional refinement; and (d) obtaining atomic coordinate model output 

10 data defining the three-dimensional structure of said at least one ligand of said BPI 
protein, BPI-related lipid binding protein, or fragment, analog, or variant thereof. 

The present invention provides a computer-based system for providing an 
atomic model of at least one ligand of a BPI protein, BPI-related lipid binding 
protein, or fragment, analog or variant thereof, comprising the following elements: 

15 (a) a computer readable medium (CRM) having stored thereon atomic coordinate data 
of a BPI protein, fragment, analog or variant thereof; (b) a CRM having stored 
thereon atomic coordinate data sufficient to generate atomic models of potential 
Iigands of a BPI protein, BPI-related lipid binding protein, or fragment, analog or 
variant thereof; (c) at least one computing subroutine for analyzing on a computer, 

20 the atomic coordinate data from (a) and (b), to determine binding sites of BPI protein, 
BPI-related lipid binding protein, or fragment, analog, or variant thereof, and to 
provide data output defining an atomic model of at least one potential ligand of BPI 
protein, BPI-related lipid binding protein, or fragment, analog, or variant thereof, 
said analyzing utilizing at least one computing subroutine selected from the group 

25 consisting of data processing and reduction, auto-indexing, intensity scaling, intensity 
merging, amplitude conversion, truncation, molecular replacement, molecular 
alignment, molecular refinement, electron density map calculation, electron density 
modification, electron map visualization, model building, rigid body refinement, 
positional refinement; and (d) retrieval means for obtaining atomic coordinate data of 

30 said at least one ligand of a BPI protein, BPI-related lipid binding protein, or 
fragment, analog or variant thereof. 
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said at least one ligand of a BPI protein, BPI-related lipid binding protein, or 
fragment, analog or variant thereof. 

Other objects of the invention will be apparent to one of ordinary skill in the 
art from the following detailed description and examples relating to the present 
5 invention. 

Brief Description of the Figures 

Fig. 1 (A) A ribbon diagram of residues 1-456 of BPI illustrating its boomerang 
shape. The NH 2 -terminal domain is shown; the COOH -terminal domain and the two 
phosphatidylcholine molecules are shown. The linker is also shown, and the disulfide 
10 bond is shown as a ball-and-stick model. (B) View after rotating (A) 70° about the 
long axis of the molecule. Figure produced with MOLSCRIPT [P. Krauliz, J. Appl. 
Cryst., 24:926 (1991)] and RASTER3D [E. A. Merrit and M. E. P. Murphy, Acta 
Crystallogr., D50:8&9 (1994); D. J. Bacon and W. F. Anderson, /. Mo. Graphics, 
6:219 (1988)]. 

15 Rg. 2 (A) Schematic drawing of the novel BPI domain fold, shown in same 
orientation as the NH 2 -terminal domain in Fig, IB (B) Superposition of the NH 2 - and 
COOH-terminal domains of BPI showing the overall topological similarity. Residues 
1 to 230 and 250 to 456 are shown. The NH 2 -terminal domain is in the same 
orientation as Fig. 1A. 

20 Fig. 3 Electron density of the final 2.8 A MIR map contoured at 1.0 a and 
superimposed on the refined model. The area shown is in the lipid binding pocket 
of the NH 2 -terminal domain of BPI. The phosphatidylcholine and the surrounding 
protein atoms are shown. 

Fig. 4 (A) The covalent structure of phosphatidylcholine and the lipid A region of 
25 LPS from £. coli and S. typhimurium. Phosphate groups are indicated by P. 
Adapted with changes from [C. R. H. Raetz, Annu. Rev. Biochem, 59:129 (1990)]. 
(B) Slice through the interior of BPI showing the lipid binding pocket in the NH 2 - 
teiminal domain. The solvent accessible surface of the protein was calculated without 
lipid present, the interior of the protein and the phosphatidylcholine are shown. 
30 Protein residues are shown as ball-and-stick. 



WO 98/58961 PCT/US98/13007' 

15 

Figure produced with MSP [M. L. Connolly, Science, 227:709 (1983); M. L. 
Connolly, /. Am. Chem. Soc, 707:1118 (1985)]. 

Figs. 5 (A) and 5 (B) The amino acid sequences of human BPI, LBP, PLTP, and 
CETP. The alignment was performed with CLUSTAL [D. G. Higgins and P. M. 

5 Sharp, Gene, 75:237 (1989)] using all eleven known protein sequences from 
mammals [R. R. Schuman, et al., Science, 249:1429 (1990); D. Drayna et al., 
Nature, 527:632 (1987); R. Day et al, J. BioL Chem., 269:9388 (1994); S.R. Leong 
and T. Camerato, Nucleic Acids Res., 78:3052 (1990); M. Nagashima, J. W. 
McLean, R. M. Lawn, /. Lipid Res., 29:1643 (1988); M. E. Pape, E. F. Rehber, K. 

10 R. Marotti, G. W. Melchior, Artheriosclerosis 77:1759 (1991); G. Su et al., J. 
Immunol., 755:743 (1994); P. W. Gray et al., J. BioL Chem. 264: 9505 (1989); 
Albers et al., Biochem. Biophys. Acta, 1258:21 (1995); X. C. Jiang et al., 
Biochemistry, 54:7258 (1995); L. B. Agellon et al., Biochemistry, 29:1372 (1990); 
X. C. Jiang et al., /. Biol Chem., 266:4631 (1991)] but only the four human 

15 sequences are shown. Residues that are completely conserved in all proteins are 
indicated below the sequence *; those which are highly conserved are indicated by 
The secondary structure of BPI is indicated above the sequences. The 0 strands 
are indicated by arrows; strands which make up the central 0 sheet are shown with 
gray arrows. Because of the j3 bulges and pronounced twisting, some of the fi strands 

20 have one or more residues that do not show classical H-bonding patterns or $V 
angles; these breaks are indicated by A above the strands. The a helices are shown 
as cylinders, and one-residue breaks in helices B and B' are indicated with a vertical 
dashed line. The horizontal dashed line indicates the linker region. Peptides from 
BPI and LBP with the highest LPS-binding activity (Little, et al. , J. BioL Chem. 268: 

25 1865 (1994); Taylor et al., J. Biol. Chem. 270: 17934 (1995)) are in bold italics. 
The disulfide bond is indicated by S-S. Residues with atoms within 4 A of the NH 2 - 
terminai lipid are highlighted with gray shading; residues within 4 A of the COOH- 
terminal lipid are shown with white letters in black boxes. 

Kg. 6 Block diagram of a computer system 102 that can be used to implement the 
30 present invention. The computer system 102 includes a processor 106 connected to 
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a bus 104. Also connected to the bus 104 are a main memory 108 (preferably 
implemented as random access memory, RAM) and a variety of secondary storage 
memory 110, such as a hard drive 112, a removable medium storage device 114, a 
command device 118, and a visualization device, 120. Also included is a removable 
5 storage medium 116. 

Detailed Description 
The present invention provides methods for crystallizing a BPI protein product 
where the crystals diffract x-rays with sufficiently high resolution to allow 
determination of the three-dimensional structure of the BPI protein product, including 

10 atomic coordinates. The three-dimensional structure (e.g, as provided on computer 
readable media as described herein) is useful for rational drug design of BPI-related 
(and LBP-, CETP-, PLTP-related) mimetics and/or ligands. Specifically provided is 
a method for crystallizing a recombinant non-glycosylated human BPI analog 
holoprotein comprising a 456 amino acid sequence wherein the amino acid serine at 

15 position 351 has been changed to alanine. The three-dimensional structure is useful 
for modeling and/or synthesizing BPI-related mimetics or ligands. Such BPI-related 
mimetics or ligands are useful for treating, inhibiting or preventing BPI-moduIated 
diseases. 

The present invention thus includes methods of expressing, purifying and 
20 crystallizing a BPI protein product from suitable sources, such as eukaryotic cells or 
tissues. The present invention also provides crystallized BPI protein products by 
these methods. The crystallized BPI is analyzed by x-ray diffraction techniques to 
obtain high resolution diffraction patterns and atomic coordinates that are suitable for 
molecular modeling. 

25 As used herein, "BPI protein product" or n BPI protein" includes naturally and 

recombinantly produced BPI protein; natural, synthetic, and recombinant biologically 
active polypeptide fragments of BPI protein; biologically active polypeptide variants 
of BPI protein or fragments thereof, including hybrid fusion proteins and dimers; 
biologically active polypeptide analogs of BPI protein or fragments or variants 

30 thereof, including cysteine-substituted analogs; and BPI-derived peptides. The BPI 
protein products for therapeutic or diagnostic uses may be generated and/or isolated 
by any means known in the art. U.S. Patent No. 5,198,541, the disclosure of which 
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is incorporated herein by reference, discloses recombinant genes encoding and 
methods for expression of BPI proteins including recombinant BPI holoprotein, 
referred to as rBPI (also referred to as rBPI 55 or simply rBPI^) and recombinant 
fragments of BPI. U.S. Patent Application Ser. No. 07/885,501, now abandoned, 

5 and a continuation-in-part thereof, U.S. Patent Application Ser. No. 08/072,063, filed 
May 19, 1993, issued as U.S. Patent 5,439,807 on August 8, 1995 and corresponding 
PCT Application No. 93/04752 filed May 19, 1993, which are all incorporated herein 
by reference, disclose novel methods for the purification of recombinant BPI protein 
products expressed in and secreted from genetically transformed mammalian host cells 

10 in culture and discloses how one may produce large quantities of recombinant BPI 
products suitable for incorporation into stable, homogeneous pharmaceutical 
preparations. 

Biologically active fragments of BPI (BPI fragments) include biologically 
active molecules that have the same or similar amino acid sequence as a natural 

15 human BPI holoprotein, except that the fragment molecule lacks amino-terminal 
amino acids, internal amino acids, and/or carboxy-terminal amino acids of the 
holoprotein. Nonlimiting examples of such fragments include a N-terminal fragment 
of natural human BPI of approximately 25 kD, described in Ooi et al., 7. Exp. Med., 
174:649 (1991), and the recombinant expression product of DNA encoding N- 

20 terminal amino acids from 1 to about 193 or 199 of natural human BPI, described in 
Gazzano-Santoro et al., Infect. Immun. 60:4754-4761 (1992), and referred to as 
rBPI^. In that publication, an expression vector was used as a source of DNA 
encoding a recombinant expression product (rBPI^) having the 31 -residue signal 
sequence and the first 199 amino acids of the N-tenninus of the mature human BPI, 

25 as set out in Figure 1 of Gray et ah, supra, except that valine at position 151 is 
specified by GTG rather than GTC and residue 185 is glutamic acid (specified by 
GAG) rather than lysine (specified by AAG). Recombinant holoprotein (rBPI) has 
also been produced having the sequence (SEQ ID NOS: 1 and 2) set out in Figure 1 
of Gray et al., supra, with the exceptions noted for rBPI 23 and with the exception that 

30 residue 417 is alanine (specified by GCT) rather than valine (specified by GTT). 
Other examples include dimeric forms of BPI fragments, as described in U.S. Patent 
No. 5,447,913, and corresponding PCT Application No. PCT/US95/03125, the 
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disclosures of which are incorporated herein by reference. Preferred dimeric 
products include dimeric BPI protein products wherein the monomers are amino- 
terminal BPI fragments having the N-terminal residues from about 1 to 175 to about 
1. to 199 of BPI holoprotein, A particularly preferred dimeric product is the dimeric 
5 form of the BPI fragment having N-terminal residues 1 through 193, designated 
rBPI 42 dimer. 

Biologically active variants of BPI (BPI variants) include but are not limited 
to recombinant hybrid fusion proteins, comprising BPI holoprotein or biologically 
active fragment thereof and at least a portion of at least one other polypeptide, and 

10 dimeric forms of BPI variants. Examples of such hybrid fusion proteins and dimeric 
forms are described by Theofan et al. in U.S. Patent Application Serial No. 
07/885,911, now abandoned, and a continuation-in-part application thereof, U.S. 
Patent Application Serial No. 08/064,693 filed May 19, 1993, issued as U.S. Patent 
5,643,570 on July 1 , 1997 and corresponding PCT Application No. US93/04754 filed 

15 May 19, 1993, which are all incorporated herein by reference and include hybrid 
fusion proteins comprising, at the amino-terminal end, a BPI protein or a biologically 
active fragment thereof and, at the carboxy -terminal end, at least one constant domain 
of an immunoglobulin heavy chain or allelic variant thereof. Similarly configured 
hybrid fusion proteins involving part or all Lipopolysaccharide Binding Protein (LBP) 

20 are also contemplated for use in the present invention. 

Biologically active analogs of BPI (BPI analogs) include but are not limited 
to BPI protein products wherein one or more amino acid residues have been replaced 
by a different amino acid. For example, U.S. Patent No. 5,420,019 and 
corresponding PCT Application No. US94/01235, filed February 2, 1994, the 

25 disclosures of which are incorporated herein by reference, discloses polypeptide 
analogs of BPI and BPI fragments wherein a cysteine residue is replaced by a 
different amino acid. A preferred BPI protein product described by this application 
is the expression pfroduct of DNA encoding from amino acid 1 to approximately 193 
or 199 of the N-terminal amino acids of BPI holoprotein, but wherein the cysteine at 

30 residue number 132 is substituted with alanine and is designated rBPI 21 Acys or rBPI 2 i . 
Other examples include dimeric forms of BPI analogs; e.g. U.S. Patent Application 
Serial No. 08/212,132 filed March 11, 1994, issued as U.S. Patent 5,447,913 on 
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September 5, 1995 and corresponding PCT Application No. PCT/US95/03125, the 
disclosures of which are incorporated herein by reference. 

Other BPI protein products useful according to the methods of the invention 
are peptides derived from or based on BPI produced by recombinant or synthetic 

5 means (BPI-derived peptides), such as those described in U.S. Patent Application 
Serial No. 08/504,841 filed July 20, 1995 and in PCT Application No. 
PCT/US94/ 10427 filed September 15, 1994, which corresponds to U.S. Patent 
Application Serial No. 08/306,473 filed September 15, 1994, issued as U.S. Patent 
5,652,332 on July 29, 1997, and PCT Application No. US94/02465 filed March 11, 

10 1994, which corresponds to U.S. Patent Application Serial No. 08/209,762, filed 
March 11, 1994, issued as U.S. Patent 5,733,872 on March 31, 1998, which is a 
continuation-in-part of U.S. Patent Application Serial No. 08/183,222, filed January 
14, 1994, now abandoned, which is a continuation-in-part of U.S. Patent Application 
Ser. No. 08/093,202 filed July 15, 1993, now abandoned, (for which the 

15 corresponding international application is PCT Application No. US94/02401 filed 
March 11, 1994), which is a continuation-in-part of U.S. Patent Application Ser. No. 
08/030,644 filed March 12, 1993, issued as U.S. Patent 5,348,942 on September 20, 
1994, the disclosures of all of which are incorporated herein by reference. 

Presently preferred BPI protein products include recombinantly-produced N- 

20 terminal fragments of BPI, especially those having a molecular weight of 
approximately between 21 to 25 kD such as rBPI^ or rBPI 2 i, or dimeric forms of 
these N-terminal fragments (e.g., rBPI 42 dimer). Additionally, preferred BPI protein 
products include rBPI 50 and BPI-derived peptides. 

The administration of BPI protein products is preferably accomplished with 

25 a pharmaceutical composition comprising a BPI protein product and a 
pharmaceutical^ acceptable diluent, adjuvant, or carrier. The BPI protein product 
may be administered without or in conjunction with known surfactants, other 
chemotherapeutic agents or additional known anti-microbial agents. One 
pharmaceutical composition containing BPI protein products (e.g., rBPI 50 , rRBPI 23 ) 

30 comprises the BPI protein product at a concentration of 1 mg/ml in citrate buffered 
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saline (5 or 20 mM citrate, 150 mM NaCl, pH 5.0) comprising 0.1% by weight of 
poloxamer 188 (Pluronic F-68, BASF Wyandotte, Parsippany, NJ) and 0.002% by 
weight of polysorbate 80 (Tween 80, ICI Americas Inc., Wilmington, DE). Another 
pharmaceutical composition containing BPI protein products (e.g., rBPI 21 ) comprises 
5 the BPI protein product at a concentration of 2 mg/mL in 5 mM citrate, 
150 mM NaCl, 0.2% poloxamer 188 and 0.002% polysorbate 80. Such combinations 
. are described in PCT Application No. US94/01239 filed February 2, 1994, which 
corresponds to U.S. Patent Application Ser. No. 08/190,869 filed February 2, 1994, 
issued as U.S. Patent 5,488,034 on January 30, 1996, and U.S. Patent Application 

10 Ser. No. 08/012,360 filed February 2, 1993, now abandoned, the disclosures of all 
of which are incorporated herein by reference. Additional formulations are provided 
in U.S. Patent Application Ser. Nos. 08/372,104, filed January 13, 1995, now 
abandoned, 08/530,599, filed September 19, 1995, now abandoned, and 08/586,133, 
filed January 12, 1996 and corresponding W096/21436 (PCT/US96/01095). 

15 The x-ray diffraction patterns of the invention are now discovered to be of 

sufficiently high resolution to be useful for three-dimensional modeling of a BPI. 
Preferably the resolution is in the range of 1.5 to 3.5 A, preferably 1.5-3.0 A and 
more preferably < 2.6 A. 

Three-dimensional modeling is performed using the diffraction coordinates 

20 from these x-ray diffraction patterns. The coordinates are entered into one or more 
computer programs for molecular modeling, as known in the art. Such molecular 
modeling can utilize known x-ray diffraction molecular modeling algorithms or 
molecular modeling software to generate atomic coordinates corresponding to the 
three-dimensional structure of at least one BPI or a fragment thereof. 

25 The entry of the coordinates of the x-ray diffraction patterns and the amino 

acid sequence into such programs results in the calculation of most probable 
secondary, tertiary and quaternary structures of the protein, including overall atomic 
coordinates of a BPI or a fragment thereof. These structures are combined and 
refined by additional calculations using such programs to determine the probable or 

30 actual three-dimensional structure of the BPI, including potential or actual active or 
binding sites of the protein. 
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Such molecular modeling (and related) programs useful for rational drug 
design of ligands or mimetics, are also provided by the present invention. The drug 
design uses computer modeling programs which calculate how different molecules 
interact with the various sites of the BPI. This procedure determines potential ligands 

5 or mimetics of a BPI or at least one fragment thereof. The actual BPI-ligand 
complexes or mimetics are crystallized and analyzed using x-ray diffraction. The 
diffraction pattern coordinates are similarly used to calculate the three-dimensional 
interaction of a ligand and the BPI or a mimetic, in order to confirm that the ligand 
binds to, or changes the conformation of, a particular site on the BPI, or where the 

10 mimetic has a similar three-dimensional structure to that of a BPI or a fragment 
thereof. 

The potential ligands or mimetics are then screened for activity relating to a 
BPI. Such screening methods are selected from assays for at least one biological 
activity of the native BPI. 

15 The resulting ligands or mimetics, provided by methods of the present 

invention, are useful for treating, screening or preventing bacterial infections in 
animals, such as mammals (including humans) and birds. Mimetics or ligands of a 
particular BPI will similarly react with other BPIs from other species, subgenera or 
genera of the BPI source organism. 

20 Also provided are biologically active BPI proteins. A BPI protein is also 

provided as a crystallized protein suitable for x-ray diffraction analysis. The x-ray 
diffraction patterns obtained by the x-ray analysis are of moderately high to high 
resolution, e.g., 1.5-3.5 A. The coordinates from these diffraction patterns are 
suitable and useful for three-dimensional modeling of the crystallized protein. 

25 During the three-dimensional modeling of the BPI, these coordinates are 

entered with the BPI amino acid sequence into computer modeling programs to 
generate secondary, tertiary and quaternary structures of the BPI, as atomic 
coordinates. These structures together provide the three-dimensional structure of the 
BPI. The calculated and confirmed three-dimensional structure is then used for 

30 rational drug design of ligands or mimetics of the BPI or a fragment thereof. 

The determination of the three-dimensional structure of a BPI protein thus has 
a broad-based utility. Significant sequence identity and conservation of important 
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structural elements is expected to exist among the BPI proteins of a particular species, 
subgenus, genus, or family. Therefore, the three-dimensional structure from one or 
a few BPI proteins can be used to identify therapeutics with one or more of the 
biological activities of BPI (and/or those of related proteins such as LBP, CETP and 

5 PLTP). 

Determination of Protein Structures 

Different techniques give different and complementary information about 
protein structure. The primary structure is obtained by biochemical methods, either 
by direct determination of the amino acid sequence from the protein, or from the 

10 nucleotide sequence of the corresponding gene or cDNA. The quaternary structure 
of large proteins or aggregates can also be determined by electron microscopy. To 
obtain the secondary and tertiary structure, which requires detailed information about 
the arrangement of atoms within a protein, x-ray crystallography is preferred. 

The first prerequisite for solving the three-dimensional structure of a protein 

15 by x-ray crystallography is a well-ordered crystal that will diffract x-rays strongly. 
The crystallographic method directs a beam of x-rays onto a regular, repeating array 
of many identical molecules so that the x-rays are diffracted from it in a pattern from 
which the structure of an individual molecule can be retrieved. Well-ordered crystals 
of globular protein molecules are large, spherical, or ellipsoidal objects with irregular 

20 surfaces, and crystals thereof contain large holes or channels that are formed between 
the individual molecules. These channels, which usually occupy more than half the 
volume of the crystal, are filled with disordered solvent molecules. The protein 
molecules are in contact with each other at only a few small regions. This is one 
reason why structures of proteins determined by x-ray crystallography are generally 

25 the same as those for the proteins in solution. 

The formation of crystals is dependent on a number of different parameters, 
including pH, temperature, protein, concentration, the nature of the solvent and 
precipitant, as well as the presence of added ions or ligands to the protein. Many 
routine crystallization experiments may be needed to screen all these parameters for 

30 the few combinations that might give crystal suitable for x-ray diffraction analysis. 
Crystallization robots can automate and speed up the work of reproducibly setting up 
large number of crystallization experiments. 
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A pure and homogeneous protein sample is important for successful 
crystallization. Proteins obtained from cloned genes in efficient expression vectors 
can be purified quickly to homogeneity in large quantities in a few purification steps. 
A protein to be crystallized is preferably at least 93-99% pure according to standard 

5 criteria of homogeneity. Crystals form when molecules are precipitated very slowly 
from supersaturated solutions. The most frequently used procedure for making 
protein crystals is the hanging-drop method, in which a drop of protein solution is 
brought very gradually to supersaturation by loss of water from the droplet to the 
larger reservoir that contains salt or polyethylene glycol solution. 

10 Different crystal forms can be more or less well-ordered and hence give 

diffraction patterns of different quality. As a general rule, the more closely the 
protein molecules pack, and consequently the less water the crystals contain, the 
better is the diffraction pattern because the molecules are better ordered in the crystal. 
X-rays are electromagnetic radiation at short wavelengths, emitted when 

15 electrons jump from a higher to a lower energy state. In conventional sources in the 
laboratory, x-rays are produced by high-voltage tubes in which a metal plate, the 
anode, is bombarded with accelerating electrons and thereby caused to emit x-rays of 
a specific wavelength, so-called monochromatic x-rays. The high voltage rapidly 
heats up the metal plate, which therefore has to be cooled. Efficient cooling is 

20 achieved by so-called rotating anode x-ray generators, where the metal plate revolves 
during the experiment so that different parts are heated up. 

More powerful x-ray beams can be produced in synchrotron storage rings 
where electrons (or positrons) travel close to the speed of light. These particles emit 
very strong radiation at all wavelengths from short gamma rays to visible light. 

25 When used as an x-ray source, only radiation within a window of suitable 
wavelengths is channeled from the storage ring. Polychromatic x-ray beams are 
produced by having a broad window that allows through x-ray radiation with 
wavelengths of 0.2-3.5 A. 

In diffraction experiments a narrow and parallel beam of x-rays is taken out 

30 from the x-ray source and directed onto the crystal to produce diffracted beams. The 
incident primary beam causes damage to both protein and solvent molecules. The 
crystal is, therefore, usually cooled to prolong its lifetime (e.g., -220 to -50°C). The 
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primary beam must strike the crystal from many different directions to produce all 
possible diffraction spots, and so the crystal is rotated in the beam during the 
experiment. 

The diffracted spots are recorded either on a film, the classical method, or by 

5 an electronic detector. The exposed film has to be measured and digitized by a 
scanning device, whereas electronic detectors feed the signals they detect directly in 
a digitized form into a computer. Electronic area detectors (an electronic film) 
significantly reduce the time required to collect and measure diffraction data. 

When the primary beam from an x-ray source strikes the crystal, some of the 

10 x-rays interact with the electrons on each atom and cause them to oscillate. The 
oscillating electrons serve as a new source of x-rays, which are emitted in almost all 
directions, referred to as scattering. When atoms (and hence their electrons) are 
arranged in a regular three-dimensional array, as in a crystal, the x-rays emitted from 
the oscillating electrons interfere with one another. In most cases, these x-rays, 

15 colliding from different directions, cancel each other out; those from certain 
directions, however, will add together to produce diffracted beams of radiation that 
can be recorded as a pattern on a photographic plate or detector. 

The diffraction pattern obtained in an x-ray experiment is related to the crystal 
that caused the diffraction. X-rays that are reflected from adjacent planes travel 

20 different distances, and diffraction only occurs when the difference in distance is 
equal to the wavelength of the x-ray beam. This distance is dependent on the 
reflection angle, which is equal to the angle between the primary beam and the 
planes. 

The relationship between the reflection angle (9), the distance between the 
25 planes (d), and the wavelength (X) is given by Bragg's law: 2d sin 9 = X. This 
relation can be used to determine the size of the unit cell in the crystal. Briefly, the 
position on the film of the diffraction data relates each spot to a specific set of planes 
through the crystal. By using Bragg's law, these positions can be used to determine 
the size of the unit call. 
30 Each atom in a crystal scatters x-rays in all directions, and only those that 

positively interfere with one another, according to Bragg's law, give rise to diffracted 
beams that can be recorded as a distinct diffraction spot above background. Each 
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diffraction spot is the result of interference of all x-rays with the same diffraction 
angle emerging from all atoms. For example, for the protein crystal of myoglobin, 
each of the about 20,000 diffracted beams that have been measured contain scattered 
x-rays from each of the around 1500 atoms in the molecule. To extract information 
5 about individual atoms from such a system requires considerable computation. The 
mathematical tool that is used to handle such problems is called the Fourier 
transform. 

Each diffracted beam, which is recorded as a spot on the film, is defined by 
three properties: the amplitude, which we can measure from the intensity of the spot; 

10 the wavelength, which is set by the x-ray source; and the phase, which is lost in x-ray 
experiments. All three properties are needed for all of the diffracted beams, in order 
to determine the position of the atoms giving rise to the diffracted beams. 

For larger molecules, protein crystallographers have determined the phases in 
many cases using a method called multiple isomorphous replacement (MIR) (including 

15 heavy metal scattering), which requires the introduction of new x-ray scatterers into 
the unit cell of the crystal. These additions are usually heavy atoms (so that they 
make a significant contribution to the diffraction pattern), such that there should not 
be too many of them (so that their positions can be located); and they should not 
change the structure of the molecule or of the crystal cell, i.e., the crystals should be 

20 isomorphous. Isomorphous replacement is usually done by diffusing different heavy- 
metal complexes into the channels of the preformed protein crystals. The protein 
molecules expose side chains (such as SH groups) into these solvent channels that are 
able to bind heavy metals. It is also possible to replace endogenous light metals in 
metalloproteins with heavier ones, e.g., zinc by mercury, or calcium by samarium. 

25 Since such heavy metals contain many more electrons than the light atoms 

(H,N,C,0 and S) of the protein, they scatter x-rays more strongly. All diffracted 
beams would therefore increase in intensity after heavy-metal substitution if all 
interference were positive. In fact, however, some interference is negative; 
consequently, following heavy-metal substitution, some spots measurably increase in 

30 intensity, others decrease, and many show no detectable difference. 

Phase differences between diffracted spots can be determined from intensity 
changes following heavy-metal substitution. First, the intensity differences are used 
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to deduce the positions of the heavy atoms in the crystal unit cell. Fourier 
summations of these intensity differences give maps of the vectors between the heavy 
atoms, the so-called Patterson maps. From these vector maps the atomic arrangement 
of the heavy atoms is deduced. From the positions of the heavy metals in the unit 
5 cell, one can calculate the amplitudes and phases of their contribution to the diffracted 
beams of protein crystals containing heavy metals. 

This knowledge is then used to find the phase of the contribution from the 
protein in the absence of the heavy-metal atoms. As both the phase and amplitude 
of the heavy metals and the amplitude of the protein alone is known, as well as the 

10 amplitude of the protein plus heavy metals (i.e., protein heavy-metal complex), one 
phase and three amplitudes are known. From this, the interference of the x-rays 
scattered by the heavy metals and protein can be calculated to see if it is constructive 
or destructive. The extent of positive or negative interference, with knowledge of the 
phase of the heavy metal, given an estimate of the phase of the protein. Because two 

15 different phase angles are determined and are equally good solutions, a second heavy- 
metal complex can be used which also gives two possible phase angles. Only one of 
these will have the same value as one of the two previous phase angles; it therefore 
represents the correct phase angle. In practice, more than two different heavy-metal 
complexes are usually made in order to give a reasonably good phase determination 

20 for all reflections. Each individual phase estimate contains experimental errors 
arising from errors in the measured amplitudes. Furthermore, for many reflections, 
the intensity differences are too small to measure after one particular isomorphous 
replacement, and others can be tried. 

The amplitudes and the phases of the diffraction data from the protein crystals 

25 are used to calculate an electron-density map of the repeating unit of the crystal. This 
map then has to be interpreted as a polypeptide chain with a particular amino acid 
sequence. The interpretation of the electron-density map is made more complex by 
several limitations of the data. First of all, the map itself contains errors, mainly due 
to errors in the phase angles. In addition, the quality of the map depends on the 

30 resolution of the diffraction data, which in turn depends on how well-ordered the 
crystals are. This directly influences the image that can be produced. The resolution 
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is measured in A units; the smaller this number is, the higher the resolution and 
therefore the greater the amount of detail that can be seen. 

Building the initial model is a trial-and-error process. First, one has to decide 
how the polypeptide chain weaves its way through the electron-density map. The 

5 resulting chain trace constitutes a hypothesis, by which one tries to match the density 
of the side chains to the known sequence of the polypeptide. When a reasonable 
chain trace has finally been obtained, an initial model is built to give the best fit of 
the atoms to the electron density. Computer graphics are used both for chain tracing 
and for model building to present the data and manipulated the models. 

10 The initial model will contain some errors. Provided the protein crystals 

diffract to high enough resolution (e.g., better than 3.5 A), most or substantially all 
of the errors can be removed by crystallographic refinement of the model using 
computer algorithms. In this process, the model is changed to minimize the 
difference between the experimentally observed diffraction amplitudes and those 

15 calculated for a hypothetical crystal containing the model (instead of the real 
molecule). This difference is expressed as an R factor (residual disagreement) which 
is 0.0 for exact agreement and about 0.59 for total disagreement. 

In general, the R factor is preferably between 0.15 and 0.35 (such as less than 
about 0.24-0.28) for a well-determined protein structure. The residual difference is 

20 a consequence of errors and imperfections in the data. These derive from various 
sources, including slight variations in the conformation of the protein molecules, as 
well as inaccurate corrections both for the presence of solvent and for differences in 
the orientation of the microcrystals from which the crystal is built. This means that 
the final model represents an average of molecules that are slightly different both in 

25 conformation and orientation. 

In refined structures at high resolution, there are usually no major errors in 
the orientation of individual residues, and the estimated errors in atomic positions are 
usually around 0.1-0.2 A, provided the amino acid sequence is known. Hydrogen 
bonds, both within the protein and to bound ligands, can be identified with a high 

30 degree of confidence. 

Those of skill in the art understand that a set of structure coordinates 
determined by X-ray crystallography is not without standard error. For the purpose 
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of this invention, any set of structure coordinates for BPI-protein that have a root 
mean square deviation of protein backbone atoms (N, Co:, C and O) of less that 
0.75 A when superimposed — using backbone atoms — on the structure coordinates 
listed in Table 4 shall be considered identical. 
5 Most x-ray structures are determined to a resolution between 1.7 A and 3.5 

A. Electron-density maps with this resolution range are preferably interpreted by 
fitting the known amino acid sequences into regions of electron density in which 
individual atoms are not resolved. 

An amino acid sequence is preferred for accurate x-ray structure 

10 determination. Thus, recombinant DNA techniques have had a double impact on x- 
ray structural work. When a protein is cloned and overexpressed for structural 
studies, the amino acid sequence, necessary for the x-ray work, is also quickly 
obtained via the nucleotide sequence. Recombinant DNA techniques give us not only 
abundant supplies of rare proteins, but also their amino acid sequence as a bonus. 

15 Overview of BPI Purification and Crystallization Methods 

In general, a BPI protein is purified as described in Example 1. The resulting 
BPI is in sufficient purity and concentration for crystallization. The BPI is then 
isolated and assayed for biological activity and for lack of aggregation (which 
interferes with crystallization). The purified BPI preferably runs as a single band 

20 under reducing or nonreducing polyacrylamide gel electrophoresis (PAGE) 
(nonreducing is used to evaluate the presence of cysteine bridges). 

The purified BPI is preferably crystallized using the hanging drop method 
under varying conditions of at least one of the following: pH, buffer type, buffer 
concentration, salt type, polymer type, polymer concentration, other precipitating 

25 agents and concentration of purified and cleaved BPI. See, e.g., the methods 
provided in a commercial kit, such as CRYSTAL SCREEN (Hampton Research, 
Riverside, CA). Differently sized and shaped crystals are tested for suitability for x- 
ray diffraction. Generally, larger crystals provide better crystallography than smaller 
crystals, and thicker crystals provide better diffraction than thinner crystals. 

30 Purified BPIs 

The results of the purification are optionally analyzed by polyacrylamide gel 
electrophoresis (PAGE) under reducing or non-reducing conditions. A single band 
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is preferably obtained. With disulfide-containing BPIs, it is preferred that the analysis 
of the cleaved BPI be under non-reducing conditions to indicate whether the cleaved 
protein formed disulfide linked dimers* The amino acid sequence can also be 
determined according to known methods, or otherwise obtained, as this sequence is 

5 important in determining the three-dimensional structure of the cleaved protein (in 
combination with crystallographic analysis), as described herein, using molecular 
modeling techniques. 

Before crystallization, the purified protein is tested for one or more of the 
known biological activities of a BPI protein. 

10 It is preferred that the biological activity exceed the activity of the native 

protein. The preferred result indicates that the BPI protein retains its native structure, 
which is important for determining the three-dimensional crystal structure of the 
biologically active molecule. To identify the protease cleavage site, the purified and 
cleaved protein can be sequenced using known techniques. See, e.g., Murti et al., 

15 Proc. Natl Acad. Sci. USA 90:1523-1525 (1993); Takimoto et al (1992), infra, 
entirely incorporated herein by reference. 
Protein Crystallization Methods 

The hanging drop method is preferably used to crystallize the purified protein. 
See, e.g., Taylor et al,J. Mol Biol 226:1287-1290 (1992); Takimoto et al (1992), 

20 infra; CRYSTAL SCREEN, Hampton Research. 

A mixture of the purified protein and precipitant can include the following: 
pH (*.£., 4-9); 

• buffer type (e.g. , phosphate, cacodylate, acetates, imidazole, Tris HC1, 
sodium HEPES); 

25 • buffer concentration (e.g., 10-200 mM); 

• salt type (e.g., calcium chloride, sodium citrate, magnesium chloride, 
ammonium acetate, ammonium sulfate, potassium phosphate, 
magnesium acetate, zinc acetate; calcium acetate) 

• polymer type and concentration: (e.g., polyethylene glycol (PEG) 
30 1-50%, average molecular weight 200-10,000); 
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• other precipitating agents (salts: K, Na tartrate, NH 4 S0 4 , NaAc, 
LiS0 4> NaFormate, NaCitrate, MgFormate, NaP0 4 , KP0 4 NH 4 P0 4 ; 
organics; 2-propanol; non-volatile: 2-methyl-2,4-pentanediol); and 

• concentration of purified BPI (e.g., 1.0-100 mg/ml). 
5 See, e.g., CRYSTAL SCREEN, Hampton Research. 

A non-limiting example of such crystallization conditions is the following: 

• purified protein (e.g., approximately 3-4 mg/ml); 
H 2 0; 

• precipitant 10-14% Polyethylene glycol (PEG) 8000 buffered with 100 
10 mM cacodylate buffer and 200 mM of Mg acetate; 

• at an overall pH of about 3.5-8.5. 

The above mixtures are used and screened by varying at least one of pH, 
buffer type; buffer concentration, precipitating salt type or concentration, PEG type, 
PEG concentration, and protein concentration. Crystals ranging in size from 0.2-0.7 
15 mm are formed in 1 -7 days. These crystals diffract x-rays to at least 3.5 A 
resolution, such as 1.5 -3.5 A, or any range of value therein, such as 1.5, 1.6, 1.7, 
1.8, 1.9, 2.0, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, or 3.0, with 3.0 A or less 
being preferred. 
Protein Crystals 

20 Crystals appear after 1-4 days and grow to maximal size within a week. From 

one ten crystals are observed in one drop and crystal forms can occur, such as, but 
not limited to, bipyramidal, rhomboid, and cubic. Initial x-ray analyses indicate that 
such crystals diffract at moderately high to high resolution. When fewer crystals are 
produced in a drop, they can be much larger size, e.g. y 0.4-0.9 mm. 

25 X-ray Crystallography Methods and Molecular Modeling a 

The crystals so produced for BPI are x-ray analyzed using a suitable x-ray 
source. Diffraction patterns are obtained. Crystals are preferably stable for at least 
10 hrs in the x-ray beam. Frozen crystals (e.g., -220 to -50°C) could also be used 
for longer x-ray exposures (e.g. , 24-72 hrs), the crystals being relatively more stable 

30 to the x-rays in the frozen state. To collect the maximum number of useful 
reflections, multiple frames are optionally collected as the crystal is rotated in the x- 
ray beam, e.g. , for 24-72 hrs. Larger crystals ( > 0.2 mm) are preferred, to increase 
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the resolution of the x-ray diffraction. Alternatively, crystals may be analyzed using 
a synchrotron high energy x-ray source. Using frozen crystals, x-ray diffraction data 
is collected on crystals that diffract to a relatively high resolution of 3.5 A or less, 
sufficient to solve the three-dimensional structure of BPI in considerable detail, as 

5 presented herein. 

Native and/or derivative x-ray diffraction data with medium resolution is 
collected on area detectors mounted on rotating anode x-ray sources. The alternative 
program DENZO is preferably used for data processing and reduction (Sawyer et al , 
eds., Proceedings of CCP4 Study Weekend, pp. 56-62, SERC Darsbary Lab., UK 

10 (1993)). 

The resolution is optionally improved using larger crystals, e.g., 0.2mm, 
making data collection more efficient, particularly for the determination of suitable 
heavy metal derivatives, such as Hg, Pt, Pb, Ba, Cd, and/or La derivatives. 

The heavy metal derivatives are used to determine the phase, e.g., by the 

15 isomorphous replacement method. Heavy atom isomorphous derivatives of BPI are 
used for x-ray crystallography, where the structure is solved using one or several 
derivatives, which, (when combined) improves the overall figure of merit. 
Derivatives are identified through Patterson maps and/or cross-phase difference 
Fourier maps, e.g., using the CCP4 package (SERC Collaborative Computing Project 

20 No. 4, Daresbury Laboratory, UK, 1979). 

Phases were also obtained or improved by optimization of the anomalous 
dispersion component of the x-ray scattering which can break the phase ambiguity 
which a single heavy atom derivative gives. In certain cases phase information may 
be obtained without the need of a native set of data, through the use of multiple 

25 wavelength with anomalous dispersion phasing (MAD phasing). The wavelength of 
the x-rays used may be selected at a synchrotron source to optimize this anomalous 
scattering. In this case data from a derivatised crystal or crystals is collected at 
typically three wavelengths, two of which are very close to the absorption edge of the 
heavy atom scatterer. One way of obtaining a suitable heavy atom derivatised crystal 

30 is to derivatise a known ligand of the protein. 

The program MLPHARE (Wolf et aL, eds., Isomorphous Replacement and 
Anomalous Scattering: Proceedings of CCP4 Study Weekend, pp. 80-86, SERC 
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Daresbury Lab., UK (1991)) is optionally used for refinement of the heavy atom 
parameters and the phases derived from them by comparing at least one of 
completeness (%), resolution (A), R r (%), heavy atom concentration (mM), soaking 
time, heavy atom sites, phasing power (acentric, centric) (See Table 1 as an 
5 analogous example from The Crystal Structure of diphtheria toxin, Choe et al., 
Nature 357: 216-222 (1992), Addition of heavy atom derivatives produce an MIR 
map with recognizable features. 

The initial phases are calculated to 3.2 A, and then improved and extended 
to a higher resolution of 2.8 A (e.g., <3.0 A) using solvent flattening, histogram 

10 matching and/or Sayre's equation in the program DM (Cowtan and Main, Acta 
Crystallogr. D 49:148-157 (1993)). The skeletonization of DM procedure is 
optionally used to improve connectivity in the bulk of the protein envelope. Both the 
MIR and density modified maps are optionally used in subsequent stages, to provide 
sufficient resolution and/or modeling of surface structures. 

15 Skeletonized representations of electron density maps are then computed. 

These maps are automatically or manually edited using suitable software, e.g., the 
graphics package FRODO (Jones et al. (1991), infra) to give a continuous Ca trace. 
The BPI sequence is then aligned to the trace. Initially pieces of idealized 
polypeptide backbone were placed into regions of the electron density map with 

20 obvious secondary structures (e.g. , a-helix, jS-sheet). After a polyalanine model was 
constructed for the protein, amino acid sidechains were added where density was 
present in the maps. The amino acid sequence of BPI was then examined for regions 
with distinct sidechain patterns (e.g., three consecutive aromatic rings). When a 
pattern in the sequence was found to match an area of the map, the correct sidechains 

25 were built onto the existing model. Eventually fragments containing recognizable 
sequence motifs were connected into a single chain, completing the tracing of the 
amino acid sequence into the maps. 

X-ray diffraction data (e.g. , to < 3 .0 A) was collected on an RAXIS 1 1C area 
detector (e.g., a Mar imaging plate) mounted on a RIGAKU rotating anode or 

30 alternatively a synchrotron x-ray source,, and processed using a suitable oscillation 
data reduction program (DENZO, Sawyer et al. eds., Proceedings of CCP4 Study 
Weekend, pp. 56-62, SERC Darsbary Lab., UK (1993). Cycles of simulated 
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annealing against these data were refined using the program X-PLOR for molecular 
dynamics for R-factor refinement (X-PLOR, Brunger et al, /. Mol Biol 205:803- 
816 (1987)). This refinement was followed by manual rebuilding with FRODO using 
experimental and 2F 0 -F C maps. The model can be optionally further refined using a 

5 least-squares refinement program, such as TNT (Tronrud et aL , Acta Crystallogr. A 
45:489-501 (1987)). 

One or more of the above modeling steps is performed to provide a molecular 
3-D model of BPL It is preferred that the BP1 model has no residues in disallowed 
regions of the Ramachandran plot, and gives a positive 3D- ID profile (Luthy et al, 

10 Nature 556:83-85 (1992)), suggesting that all the residues are in acceptable 
environments (Kraulis (1991), infra). 

Multiple isomorphous replacement phase determination was used for solving 
the three-dimensional structure of BPL This structure is then used for rational drug 
design of BPI ligands or mimetics of at least one BPI bactericidal activity, or other 

15 biological activity important in inactivating bacterial toxicity, replication and/or 
infection. 

Computer Related Embodiments 

An amino acid sequence of a BPI protein (or related protein such as LBP, 
CETP or PLTP) and/or x-ray diffraction data, useful for computer molecular 

20 modeling of BPI protein (or related protein such as LBP, CETP or PLTP) or a 
portion thereof, can be "provided" in a variety of mediums to facilitate use thereof. 
As used herein, provided refers to a manufacture, which contains, for example, a BPI 
amino acid sequence and/or atomic coordinate/x-ray diffraction data of the present 
invention, e.g., an amino acid sequence provided in Figure 5, a representative 

25 fragment thereof, or an amino acid sequence having at least 80-100% overall identity 
to an amino acid fragment of an amino acid sequence of Figure 5 or a variant thereof. 
Such a method provides the amino acid sequence and/or x-ray diffraction data in a 
form which allows a skilled artisan to analyze and molecular model the three- 
dimensional structure of a BPI-related protein, including a subdomain thereof. 

30 In one application of this embodiment, BPI (or related protein such as LBP, 

CETP or PLTP), or at least one subdomain thereof, amino acid sequence and/or x-ray 
diffraction data of the present invention is recorded on computer readable medium. 
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As used herein, "computer readable medium" refers to any medium which can be 
read and accessed directly by a computer. Such media include, but are not limited 
to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as optical discs or CD-ROM; electrical 
5 storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. A skilled artisan can readily appreciate how any of 
the presently known computer readable mediums can be used to create a manufacture 
comprising computer readable medium having recorded thereon an amino acid 
sequence and/or x-ray diffraction data of the present invention. 

10 As used herein, "recorded" refers to a process for storing information on 

computer readable medium. A skilled artisan can readily adopt any of the presently 
know methods for recording information on computer readable medium to generate 
manufactures comprising an amino acid sequience and/or atomic coordinate/x-ray 
diffraction data information of the present invention. 

15 A variety of data storage structures are available to a skilled artisan for 

creating a computer readable medium having recorded thereon an amino acid 
sequence and/or atomic coordinate/x-ray diffraction data of the present invention. 
The choice of the data storage structure will generally be based on the means chosen 
to access the stored information. In addition, a variety of data processor programs 

20 and formats can be used to store the sequence and x-ray data information of the 
present invention on computer readable medium. The sequence information can be 
represented in a word processing text file, formatted in commercially-available 
software such as WordPerfect and MICROSOFT Word, or represented in the form 
of an ASCII file, stored in a database application, such as DB2, Sybase, Oracle, or 

25 the like. A skilled artisan can readily adapt any number of dataprocessor structuring 
formats (e.g. text file or database) in order to obtain computer readable medium 
having recorded thereon the information of the present invention. 

By providing computer readable medium having stored thereon a BPI or 
related sequence protein and/or atomic coordinates based on x-ray diffraction data, 

30 a skilled artisan can routinely access the sequence and atomic coordinate or x-ray 
diffraction data to model a BPI or related protein, a subdomain thereof, mimetic, or 
a ligand thereof. Computer algorithms are publicly and commercially available which 
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allow a skilled artisan to access this data provided in a computer readable medium 
and analyze it for molecular modeling and/or RDD. See, e.g., Biotechnology 
Software Directory, Mary Ann Liebert Publ., New York (1995). 

The present invention further provides systems, particularly computer-based 

5 systems, which contain the sequence and/or diffraction data described herein. Such 
systems are designed to do structure determination and RDD for a BPI or related 
protein or at least one subdomain thereof. Non-limiting examples are microcomputer 
workstations available from Silicon Graphics Incorporated and Sun Microsystems 
running UNIX based, Windows NT or IBM OS/2 operating systems. 

10 As used herein, "a computer-based system" refers to the hardware means, 

software means, and data storage means used to analyze the sequence and/or x-ray 
diffraction data of the present invention. The minimum hardware means of the 
computer-based systems of the present invention comprises a central processing unit 
(CPU), input means, output means, and data storage means. A skilled artisan can 

15 readily appreciate which of the currently available computer-based systems are 
suitable for use in the present invention. A visualization device, such as a monitor, 
is optionally provided to visualize structure data. 

As stated above, the computer-based systems of the present invention comprise 
a data storage means having stored therein a BPI or related protein or fragment 

20 sequence and/or atomic coordinate/x-ray diffraction data of the present invention and 
the necessary hardware means and software means for supporting and implementing 
an analysis means. As used herein, "data storage means" refers to memory which 
can store sequence or atomic coordinate/x-ray diffraction data of the present 
invention, or a memory access means which can access manufactures having recorded 

25 thereon the sequence or x-ray data of the present invention. 

As used herein, "search means" or "analysis means" refers to one or more 
programs which are implemented on the computer-based system to compare a target 
sequence or target structural motif with the sequence or x-ray data stored within the 
data storage means. Search means are used to identify fragments or regions of a BPI 

30 or related protein which match a particular target sequence or target motif. A variety 
of known algorithms are disclosed publicly and a variety of commercially available 
software for conducting search means are and can be used in the computer-based 
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systems of the present invention. A skilled artisan can readily recognize that any one 
of the available algorithms or implementing software packages for conducting 
computer analyses can be adapted for use in the present computer-based systems. 

As used herein, "a target structural motif," or "target motif/ refers to any 
5 rationally selected sequence or combination of sequences in which the sequence(s) are 
chosen based on a three-dimensional configuration or electron density map which is 
formed upon the folding of the target motif. There are a variety of target motifs 
known in the art. Protein target motifs include, but are not limited to, enzymic active 
sites, structural subdomains, epitopes, functional domains and signal sequences. A 

10 variety of structural formats for the input and output means can be used to input and 
output the information in the computer-based systems of the present invention. 

A variety of comparing means can be used to compare a target sequence or 
target motif with the data storage means to identify structural motifs or electron 
density maps derived in part from the atomic coordinate/x-ray diffraction data. A 

15 skilled artisan can readily recognize that any one of the publicly available computer 
modeling programs can be used as the search means for the computer-based systems 
of the present invention. 

One application of this embodiment is provided in Figure 6. Figure 6 
provides a block diagram of a computer system 102 that can be used to implement 

20 the present invention. The computer system 102 includes a processor 106 connected 
to a bus 104. Also connected to the bus 104 are a main memory 108 (preferably 
implemented as random access memory, RAM) and a variety of secondary storage 
memory 110, such as a hard drive 112, a removable medium storage device 1 14, a 
command device 118, and a visualization device, 120. Also included is a removable 

25 storage medium 116. The removable medium storage device 114 may represent, for 
example, a floppy disk drive, a CD-ROM drive, a magnetic tape drive, etc. A 
removable storage medium 116 (such as a floppy disk, a compact disk, a magnetic 
tape, etc.) containing control logic and/or data recorded therein may be inserted into 
the removable storage device 114. The computer system 102 includes appropriate 

30 software for reading the control logic and/or the data from the removable storage 
medium 116 once inserted in the removable medium storage device 114. 
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Amino acid, encoding nucleotide or other sequence and/or atomic 
coordinate/x-ray diffraction data of the present invention may be stored in a well 
known manner in the main memory 108, or any of the secondary storage devices 1 10, 
and/or a removable storage medium 116. Software for accessing and processing the 

5 amino acid sequence and/or atomic coordinate/x-ray diffraction data (such as search 
tools, comparing tools, etc.) reside in main memory 108 during execution. User 
commands are implemented through a command device 118, such as a keyboard. 
The visualization device 120 is optionally used to visualize the structure data. 
Structure Determination 

10 One or more computational steps, computer programs and/or computer 

algorithms are used to provide a molecular 3-D model of BPI or related protein, using 
amino acid sequence data from Figure 5 (or fragments or variants thereof) and/or 
atomic coordinate/x-ray diffraction data. In x-ray crystallography, x-ray diffraction 
data and phases are combined to produce electron density maps in which the three- 

15 dimensional structure of a BPI protein is then built or modeled. MIR Phase 
determination was used for solving the three-dimensional structure of BPI. This 
structure can then be used for RDD of mimetics or ligands of a BPI or related protein 
and its associated biological activity, which is relevant to a protein modulated disease. 
Density Modification and Map Interpretation 

20 Electron density maps were calculated by X-PLOR or alternatively using such 

programs as those from the CCP4 computing package (SERC (UK) Collaborative 
Computing Project 4, Daresbury Laboratory, UK, 1979). If non-crystallographic 
symmetry axes are present, cycles of symmetry averaging can further be used, such 
as with the program RAVE (Kleywegt & Jones, Bailey et aL, eds., First Map to 

25 Final Model, SERC Daresbury Laboratory, UK, pp 59-66 (1994)) and gradual model 
expansion. For map visualization and model building the program FRODO was used 
or alternatively, a program such as tt O" (Jones (1991), infra) can be used. 

Refinement and Model Validation 

Rigid body and positional refinement can be carried out using a program such 
30 as X-PLOR (Brunger (1992), infra), e.g., with the stereochemical parameters of Engh 
and Huber (Acta Cryst. .447:392-400 (1991)). If the model at this stage in the 
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averaged maps is still missing residues (e.g., at least 5-10 per subunit), some or all 
of the missing residues can be incorporated in the model during additional cycles of 
positional refinement and model building. The refinement procedure can start using 
data from lower resolution (e.g., 25-10A to 10-3.0 A and then be gradually extended 
5 to include higher resolution data from 12-6A to 3.0-1.5 A). B- values (also termed 
temperature factors) for individual atoms were refined once data between 2.9 and 1.5 
A has been added. Subsequently waters were gradually added by manual inspection 
of electron density maps. Alternatively, a program such as ARP (Lamzin and 
Wilson, Acta Cryst. D49: 129-147 (1993)) can be used to add crystallographic waters 

10 and as a tool to check for bad areas in the model. The programs PROCHECK 
(Lackowski ex ah, 7. AppL Cryst. 26:283-291 (1993)), WHATIF (Vriend, 7. MoL 
Graph. 5:52-56 (1990)), PROFILE 3D (Luthy et al, Nature 356:83-85 (1992)), and 
ERRAT (Colovos & Yeates Protein Science, 2:1511-19 (1993)) as well as the 
geometrical analysis generated by X-PLOR were used to check the structure for 

15 errors. Anisotropic scaling between F^ and was applied after careful assessment 
of the quality and completeness of the data. 

The program DSSP was used to assign the secondary structure elements 
(Kabsch and Sander, Biopolymers, 22:2577-2637 (1983)). A program such as 
SUPPOS (from the BIOMOL crystallographic computing package) can be used for 

20 some or all of the least-squares superpositions of various models and parts of models. 
The program ALIGN (Cohen J. MoL Biol, 190: 593-604 (1986)) was used to 
superimpose N- and C-terminal domains of BPI. Solvent accessible surfaces and 
electrostatic potentials can be calculated using such programs as GRASP (Nicholls et 
al. (1991), infra). 

25 The structure of BPI from different organisms and the related proteins LBP, 

CETP and PLTP can thus be solved with the molecular replacement procedure such 
as by using X-PLOR (Brunger (1992), infra). A partial search model for a portion 
or all of these proteins can be constructed using the structures of BPI. The rotation 
and translation function can be used to yield orientations and positions for these 

30 models. Symmetry averaging can also be done using the RAVE program and model 
expansion can also be used to add missing residues resulting in a model with 95- 
99.9% of the total number of residues. The model can be refined in a program such 
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as X-PLOR (Briinger (1992), supra), to a suitable crystallographic R faaor . The model 
data is then saved on computer readable medium for use in further analysis, such as 
rational drug design. 

Rational Design ofMimetics or Ligands 

5 The determination of the crystal structure of a BPI protein, as described 

herein, provides a basis for the design of new and specific agents, including proteins 
or organic compounds. 

Several approaches can be taken for the use of the crystal structure of a BPI 
in the rational design of protein or organic analogs having a relevant activity similar 

10 to that of a BPI or related protein. A computer-assisted, manual examination of a 
BPI potential binding site structure is optionally done. The use of software such as 
GRID — Goodford, /. Med. Chem. 28:849-857 (1985) a program that determines 
probable interaction sites between probes with various functional group characteristics 
and the protein surface — is used to analyze the surface sites to determine structures 

15 of similar inhibiting proteins or compounds. The GRID calculations, with suitable 
inhibiting groups on molecules (e.g., protonated primary amines) as the probe, are 
used to identify potential hotspots around accessible positions at suitable energy 
contour levels. 

A diagnostic or therapeutic BPI or related protein modulating ligand of the 
20 present invention can be, but is not limited to, at least one selected from a lipid, a 
nucleic acid, a compound, a protein, an element, an antibody, a saccharide, an 
isotope, a carbohydrate, an imaging agent, a lipoprotein, a glycoprotein, an enzyme, 
a detectable probe, and antibody or fragment thereof, or any combination thereof, 
which can be detectably labeled as for labeling antibodies. Such labels include, but 
25 are not limited to, enzymatic labels, radioisotope or radioactive compounds or 
elements, fluorescent compounds or metals, chemiluminescent compounds and 
bioluminescent compounds. Alternatively, any other known diagnostic or therapeutic 
agent can be used in a method of the invention. Suitable compounds are then tested 
for activities of a BPI protein or BPI mimetic. 
30 The program DOCK (Kuntz et al. J. Mol Biol, 767:269-288 (1982)) may be 

used to analyze an active site or ligand binding site and suggest ligands with 
complementary steric properties. Several methodologies for searching three- 
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dimensional databases to test pharmacophore hypotheses and select compounds for 
screening are available. These include the program CAVEAT (Bacon et al. /. Mol 
Biol, 225: 849-858 (1992)) which uses databases of cyclic compounds which can act 
as "spacers" to connect any number of chemical fragments already positioned in the 

5 active site. This allows one skilled in the art to quickly generate hundreds of possible 
ways to connect the fragments already known or suspected to be necessary for tight 
binding. The program LUDI (Bohm et al. J. Comput.-Aid. Mol. Des., 6:61-78 
(1992)) can determine a list of interactions sites into which to place both hydrogen 
bonding and hydrophobic fragments. LUDI then uses a library of approx. 600 linkers 

10 to connect up to four different interaction sites into fragments. Then smaller 
"bridging" groups such as -CH2- and -COO- are used to connect these fragments. 
For example, for the enzyme DHFR, the placements of key functional groups in the 
well-known inhibitor methotrexate were reproduced by LUDI. See also, Rotstein and 
Murcko, J. Med. Chem., 36:1700-1710 (1992)). 

15 After preliminary experiments are done to determine the K t of a ligand (e.g. , 

a lipid ligand) by BPI (or related) protein to a BPI (or related) protein, mimetic or 
fragment, the time-dependent nature of the inhibition by the BPI or related protein 
(e.g., by the method of Henderson (Biochem. J. 727:321-333 (1972)) is determined. 
For example, a lipid ligand and a BPI mimetic are pre-incubated in buffer. 

20 Reactions are initiated by the addition of detecting substrate. Aliquots are removed 
over a suitable time course and each quenched by addition into the aliquots of suitable 
quenching solution. The concentration of product are determined by known methods 
of detection. Plots of activity against time can be close to linear over the assay 
period, and are used to obtain values for the initial velocity in the presence (Vj) or 

25 absence (V D ) of, for example, a BPI mimetic. Error is present in both axes in a 
Henderson plot, making it inappropriate for standard regression analysis 
(Leatherbarrow, Trends Biochem. ScL 75:455-458 (1990)). Therefore, Kj values are 
obtained from the data by fitting to a modified version of the Henderson equation for 
competitive inhibition: 

30 Or 2 + (E t - Q - I t )r - E t = 0 
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where (using the notation of Henderson (Biochem. J. 727:321-333 (1972)): 



and 




5 

This equation is solved for the positive root with the constraint that Q = Kj(( A t + IQ 
/ KJ using PROCNUN from SAS (SAS Institute Inc., Cary, North Carolina, USA) 
which performs nonlinear regression using least-square techniques. The iterative 
method used is optionally the multivariate secant method, similar to the Gauss- 
10 Newton method, except that the derivatives in the Taylor series are estimated from 
the histogram of iterations rather than supplied analytically. A suitable convergence 
criterion is optionally used, e.g. , where there is a change in loss function of less than 
10- 8 . 

Once modulating compounds are found, crystallographic studies of co- 
15 complexes, for example, BPI mimetics complexed to a ligand are performed. As 
used herein, a co-complex refers to a BPI protein, fragment, analog or variant thereof 
in covalent or non-covalent association with a chemical entity or compound. As a 
non-limiting example, BPI crystals are soaked for 2 days in 0.01-100 mM inhibitor 
compound and x-ray diffraction data are collected on an area detector and/or an image 
20 plate detector (e.g. , a Mar image plate detector) using a rotating anode x-ray source. 
Data are collected to as high a resolution as possible, e.g., <3.0 A, and merged 
with a suitable /?- factor on intensities. An atomic model of the mimetic is built into 
the difference Fourier map (inhibitor complex -^nauve)- The model can be refined to 
convergence in a cycle of simulated annealing (Briinger (1987), infra) involving 10- 
25 100 cycles of energy refinement, 100-10,000 1-fs steps of room temperature dynamics 
and/or 10-100 more cycles of energy refinement. Harmonic restraints may be used 
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for the atom refinement, except for atoms within a 10-15 A radius of the inhibitor. 
An /{-factor is calculated for the model as well as an r.m.s. deviation from the ideal 
bond lengths and angles. 

Direct measurements of activity provide further confirmation that the modeled 
5 mimetic compounds are high-affinity inhibitors for the lipid ligands. Other suitable 
assays for biological activity known for BPI or related proteins may be used. 

Preferably, little or no change in the structure of the BPI or mimetic occurs 
in the electron density map described above. Ki values are determined by a 
previously described method (Henderson (1972), infra) to evaluate mimetic proteins 

10 or organic compounds. 

Atomic coordinates of BPI proteins are useful in the generation of molecular 
models of related proteins and of BPI mimetics. The atomic coordinates generated 
from the solved three-dimensional structure of BPI disclosed herein may be utilized 
in combination with additional structural and/or physicochemical information, such 

15 as amino acid sequence data, x-ray diffraction data, combinations of x-ray diffraction 
data from multiple isomorphous replacement molecular replacement techniques, or 
other phase determination techniques. These combinations may be used to generate 
other three-dimensional coordinate data useful to generate secondary, tertiary and/or 
quaternary structures and/or domains of BPI or related proteins, including BPI-related 

20 lipid binding proteins, or their fragments, analogs, or variants. These alternate 
coordinate sets are useful to provide overall three-dimensional structure, as well as 
binding and or active sites of a BPI or related protein, including BPI-related lipid 
binding proteins, or their fragments, analogs, or variants. These alternate coordinate 
data sets are also useful in molecular modeling computer-based systems and methods 

25 for rational drug design of mimetics and ligands of BPI and other related proteins, 
including other BPI-related lipid binding proteins. Utilizing CLUSTAL (a multiple 
sequence alignment program in PC-Gene) and the Homology module (a structure- 
based homology modeling program in Insightll on a Silicon Graphics Incorporated 
workstation, molecular models (and the corresponding three-dimensional coordinates 

30 files) of lipopoly saccharide binding protein (LBP), cholesteryl ester transfer protein 
(CETP) and phospholipid transfer protein (PLTP) are generated. With these files, 
existing mutants are mapped and new ones designed. 



WO 98/58961 



PCT/US98/13007 



43 

The results described herein demonstrate that tight-binding mimetics of a BPI 
or related protein, based on the crystal structure of BPI, are provided by the present 
invention. Demonstration of clinically relevant levels of a biological activity of the 
mimetic is also useful. 
5 In evaluating mimetics for biological activity in animal models (e.g. , mouse, 

rat, rabbit, baboon) various oral and parenteral routes of administration are evaluated. 
Using this approach, it is expected that a biological activity occurs in suitable animal 
models, e.g., using the mimetics discovered by structure determination and x-ray 
crystallography. 

10 Having now generally described the invention, the same will be more readily 

understood through reference to the following examples which are provided by way 
of illustration, and are not intended to be limiting of the present invention. 

Example 1: Preparation and Purification for Crystallization 
Construction ofPlasmids Containing BPI (S351A) 

15 BPI contains a single N-linked glycosylation site at the asparagine at position 

349 which was eliminated by genetic engineering of the DNA sequence of BPI as 
follows. For glycosylation to occur at this position, the asparagine must occur within 
the sequence Asn-X-Ser/Thr where* X can be any amino acid, except proline. 
N-linked glycosylation can be eliminated by either changing the Asn to another amino 

20 acid such as glutamine or by changing the serine or threonine to an alternate amino 
acid. The latter strategy was used to construct vectors containing BPI with an alanine 
at position 351 instead of serine. 
Construction of Plasmids for BPI Expression 

The plasmid pIC108 containing a cDNA encoding BPI cloned in a T3T7 

25 plasmid (Clontech, Palo Alto, CA) served as the starting point for the construction 
of a vector for expression of nonglycosylated rBPI in mammalian cells. 

To allow insertion of BPI into an optimized mammalian expression vector, a 
unique Xhol site was first added to the 3' end of the BPI gene in pIC108. Two 
oligonucleotides were synthesized for this purpose: BPI-53 (5 ' ACT GGT TCC ATG 

30 GAG GTC AGC GCC 3') encoding amino acids 361 - 370 of BPI and BPI-54 (5' 
GAC AGA TCT CTC GAG TCA TTT ATA GAC AA 3') encoding the last four 
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amino acids of coding sequence, the stop codon (TGA), and incorporating an Xhol 
site immediately downstream of the stop codon. These oligonucleotides were used 
to PGR amplify a 280 bp fragment of the C-terminus of BPI and incorporate the Xhol 
site at the 3 ' end of the gene. The amplified fragment was digested with Ncol and 
5 Bgttl and ligated to a -4100 bp NcoI-BamHI fragment from pIC108 to generate the 
plasmid pSSlOl. 

Construction of Plasmid with BPI (S3S1A) 

\ The glycosylation site was next removed by replacing the region from a unique 
Xcml site to a unique Sphl site within the BPI gene in pSSlOl with an annealed 
10 oligonucleotide that contained the codon (TCC) for the serine at amino acid position 
351 changed to the codon (GCC) for alanine as shown below. 

Wild type 

Xcml Sphl 
. . . CCC AAC TCC TCC CTG GCT TCC CTC TTC CTG ATT GGC ATG CAC 
15 . . .GGG TTC AGG AGG GAC CGA AGG GAG AAG GAC TAA CCG TAC GTG 
Pro Asn Ser Ser Leu Ala Ser Leu Phe Leu lie Gly Met His 
351 

Nonglycosylated 

Xcml Sphl 
20 ... CCC AAC TCC GCC CTG GCT TCC CTC TTC CTG ATT GGC ATG CAC 
. . . GGG TTC AGG CGG GAC CGA AGG GAG AAG GAC TAA CCG TAC GTG 
Pro Asn Ser Ala Leu Ala Ser Leu Phe Leu He Gly Met His 
351 

This step generated the plasmid pSS102. 

25 To construct the vector, pING4322 t for the expression of full length 

nonglycosylated holo BPI in mammalian cells, pSS102 was digested with BstBl and 
Xhol and a 596 bp fragment, which included the modified BPI sequence, was purified 
and ligated to the large BsiBl-Xhol fragment from pING4147 which contains the gpt 
gene encoding resistance to mycophenolic acid, the human Ig enhancer, the human 

30 cytomegalovirus promoter (CMV) and the mouse light chain 3' untranslated region 
and is identical to the vector, pING4144 as described in U.S. Patent No. 5,420,019 
and W094/18323 (PCT/US94/01235) hereby incorporated by reference, except that 
it contains the codon for the native cysteine instead of an alanine at amino acid 
position 132 of the BPI gene. 

35 Stable Transfection of Mammalian Cells for Expression of Nonglycosylated BPI 
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Mammalian cells are preferred hosts for production of rBPI protein analogs 
as described herein. Such cells permit proper secretion, folding, and 
post-translational modification of expressed proteins. Presently preferred mammalian 
host cells for production of BPI proteins include cells of fibroblast origin, such as 
5 CHO-K1 cells (ATCC CCL61), CHODG44 cells (a dihydrofolate reductase [DHFR] 
minus mutant of CHO Toronto obtained from Dr. Lawrence Chasin, Columbia 
University), CHO-DXB-11 (a DHFR" mutant of CHO-K1 obtained from Dr. 
Lawrence Chasin), Vero Cells (ATCC CRL81) and Baby Hamster Kidney (BHK) 
cells (ATCC CRL6281) and cells of lymphoid origin, such as the hybridoma 
10 Sp2/0-Agl4 (ATCC CRL1581) or the myeloma, NSO (ECACC No. 85110503). 

Transfection of mammalian cells can be accomplished by a variety of methods. 
Two of the most common approaches involve calcium phosphate precipitation of the 
expression vector DNA which is subsequently taken by the cells and electroporation, 
which causes the cells to take up the DNA through membrane pores created by the 
15 generation of a strong electric field. Selection for transfected cells is facilitated by 
the incorporation in the expression vector of a gene whose product allows the 
transfected cells to survive and grow under selective conditions. A number of such 
genes have been identified including, among others, the bacterial Tn5 neo gene, 
which encodes resistance to the antibiotic G418 and the Escherichia coli guanine 
20 phosphoribosyl transferase (gpt) gene, which encodes resistance to mycophenolic acid 
(MP A) in the presence of xanthine (Mulligan and Berg, Proc. NatL Acad. Sci. 
78:2072-2076 (1981)), the dihydrofolate reductase (DHFR) gene, which allows for 
growth of DHFR* cells in the absence of nucleosides and gene amplification in the 
presence of increasing concentrations of methotrexate, the glutamine synthetase gene, 
25 which allows for growth of glutamine auxotrophs without glutamine and gene 
amplification in the presence of methionine sulfoximine and the Salmonella 
typhimurium hisD gene and the E. coli trpB gene (Hartman and Mulligan, Proc. NatL 
Acad. ScL 85: 8047-8051 (1988)), which allow growth in the presence of histidinol 
or without tryptophan (in the presence of indole), respectively. The availability of 
30 these selective markers provide significant flexibility for the generation of mammalian 
cell lines that express recombinant products, since they can be used either alone or 
in various combinations to provide cell lines with the highest possible productivity. 
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Transfection of CHO-K1 Cells with pING4322 

The CHO-K1 cell line was maintained in Ham's F12 medium plus 10% fetal 
bovine serum (FBS). Media were supplemented with glutamine/penicillin/strepto- 
mycin (Irvine Scientific, Irvine, California). 
5 CHO-K1 cells were transfected by electroporation with 40 fig of pING4322 

DNA that was first digested with Pvul, phenol-chloroform extracted and ethanol 
precipitated. Following the electroporation, the cells were allowed to recover for 24 
hours in non-selective Ham's F12 medium. The cells were then trypsinized, 
resuspended at a concentration of -5 X 10 4 cells/ml in Ham's F12 medium 

10 supplemented with MPA (25 /ig/mL) and xanthine (250 ptg/mL) and plated at ~ 10 4 
cells/well in 96 well plates. Untransfected CHO-K1 cells are unable to grow in this 
medium due to the inhibition of pyrimidine synthesis by the MPA. At -2 weeks, 
colonies consisting of transfected cells were observed in the 96 well plates. 
Supernatants from wells containing single colonies were analyzed for the presence of 

15 BPI-reactive protein by anti-BPI ELISA using BPI23 as a standard. In this assay, 
ImmuIon-II 96 well plates (Dynatech) were pre-coated with affinity purified rabbit 
anti-BPI^, antiserum, followed by supernatant samples and detection was with affinity 
purified, biotinylated rabbit anti-BPI^ antiserum followed by peroxidase-labeled 
avidin. A total of 100 colonies were Screened in this manner. The top isolates were 

20 transferred to 24 well plates and productivity was assessed as follows. Cells were 
grown to confluence in a 24 well plate in Ham's F12 medium supplemented with 10% 
FBS. Once the cells reached confluence, the Ham's F12 medium was removed and 
2 ml of HB-CHO serum free medium (Irvine Scientific) plus 40 /iL of S-Sepharose 
beads (Pharmacia) were added. The cells were incubated for 7 days after which the 

25 S-Sepharose beads were removed and washed with 0. 1 M NaCl in 10 mM Tris buffer 
(pH 7.5). BPI was eluted from the beads by addition of 1.0 M NaCl in Tris buffer. 
The top producers, designated Clones 37 and 91, secreted —17 and 14 jig/ml, 
respectively in this assay and were frozen as Research Cell Bank numbers C2020 and 
C2021, respectively. Purified protein was prepared for crystallization studies as 

30 follows. 

Production and Purification of Nonglycosylated rBPL 



WO 98/58961 



PCT/US98/13007 ' 



47 

The host cells used to prepare protein for crystallization studies were CHO-K1 
cells transformed with the DN A vector pING4322 which includes DNA encoding the 
456 amino acids of human BPI preceded by its endogenous 31 residue secretory 
signal as described above. During post-translational secretory processing, the signal 

5 sequence residues were removed by the host cell. The desired expression product, 
nonglycosylated rBPI, was a biologically active variant of the human BPI molecule 
in which the amino acid serine at position 351 in the human BPI protein has been 
replaced with an alanine. 

Forty roller bottles were prepared which contained the transfected CHO-K1 host 

10 cells at 1.3 x 10 7 cells per bottle in DME/F12 media supplemented with 5% fetal 
bovine serum (FBS). The cells were grown for three days, at which time 500 ml of 
fresh media, DME/F12 with 2.5% FBS was added along with a 10 ml slurry 
(approximately 8 gr.) of sterilized S-Sepharose (Pharmacia, fast flow #17-0511-01, 
Uppsula, Sweden) and 1 ml of a 1 M solution of sodium butyrate. After two days, 

15 the old media plus the S-Sepharose was removed and fresh media, S-Sepharose and 
sodium butyrate were added to each roller bottle. This process of harvesting the 
expressed protein product with S-Sepharose was repeated for a total of three harvests, 
and the S-Sepharose removed during each harvest was pooled. The use of S- 
Sepharose beads to capture recombinant BPI protein products has been described in 

20 U.S. Patent No. 5,439,807 and WO93/23540 (PCT/US93/04752). 

The expressed nonglycosylated rBPI protein was purified from the pooled S- 
Sepharose by first removing it from the S-Sepharose resin followed by further 
purification and concentration on a series of Q-Sepharose (Pharmacia, fast flow #17- 
0510-01) and CM-Spherodex (Sepracor, #273431, Villeneuve la Garenne, France) 

25 columns. Following purification, the protein was buffer exchanged utilizing a 
Sephacryl S-100 (Pharmacia, high resolution #17-0612-01) column. 

Specifically, the pooled S-Sepharose resin from each harvest was allowed to 
settle for approximately 15 minutes. The media was removed by decanting and the 
settled resin was washed three times with approximately 400 ml of 20 mM MES, pH 

30 6.8, 150 mM NaCl. For each wash, the buffer was added, the mixture was stirred 
gently and the resin was allowed to resettle for approximately 15 minutes. Each 
buffer wash was removed by decanting. The beads were then washed with 400 mL 
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of 20 mM sodium acetate/acetic acid, 150 mM NaCl, pH 4.0 (acetate buffer), and 
then poured into a 2.5 x 50 cm liquid chromatography column (BIORAD, 
Econocolumn, Richmond, California). The column was washed extensively with 
approximately 2 liters of 400 mM NaCl-acetate buffer, pH 4.0 until the A280 
5 absorbance reading of the column eluate was equal to that of the buffer alone. The 
column was additionally washed with approximately 600 ml of 600 mM NaCl-acetate 
buffer until the A280 absorbance of the eluate was again equal to that of buffer alone. 
^ The protein was then eluted from the S-Sepharose in approximately 500 ml of 1.0 M 
NaCl-acetate buffer. 

10 The S-Sepharose eluates from each harvest were pooled and diluted to a NaCl 

concentration of 300 mM. The diluted material was then loaded on to a two column, 
serial arrangement of a 100 ml Q-Sepharose column connected to a 12 ml CM- 
Spherodex column. Both columns were constructed using new, sterile resin and were 
pre-equilibrated with 20 mM MES, pH 5.5, 200 mM NaCl. The Q-Sepharose 

15 column served to remove any nucleic acid in the sample material, and the protein did 
not bind to this resin. After the approximate 3 liters of protein containing material 
had been loaded, the Q-Sepharose column was disconnected and the CM-Spherodex 
column was washed with buffer until the A280 absorbance of the eluate was the same 
as buffer alone. The protein was eluted from the column in 20 mM MES, 400 mM 

20 NaCl, pH 5.5 in a volume of approximately 180 ml. This eluted fraction was then 
reloaded on to a smaller (2 ml) CM-Spherodex column for protein concentration, and 
the bound protein removed in a single step of 20 mM MES, 1.2 M NaCl, pH 5.5 in 
a volume of approximately 12 mL. The protein was then loaded directly on to a 150 
ml pyrogen-free Sephacryl S-100 column pre-equilibrated with 20 mM sodium citrate, 

25 150 mM NaCl, pH 5.0 buffer. Column fractions were analyzed by Coommassie- 
stained (0.5% Coommassie Brilliant Blue-R, 25% isopropanol, 10% methanol, 10% 
acetic acid) SDS-PAGE and Western analysis. Western analysis was performed 
using a 1:1000 dilution of a rabbit anti-human BPI antisera. Fractions which 
contained the nonglycosylated rBPl protein were pooled and resulted in a final lot 

30 which was greater than 95% pure as analyzed by Coommassie-stained SDS-PAGE. 

The protein samples thus prepared and purified were filtered and/or 
concentrated for crystallization studies of the purified nonglycosylated rBPI protein. 
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Protein samples were optionally filtered using a 0.2 /im syringe filter (Millipore 
Corp., Bedford, MA) or a 0.2 /xm Nalgene filter (Nalge Corp., Rochester, NY) to 
remove precipitate. Protein samples were concentrated in a Centricon 10 (Amicon 
Corp., Beverly, MA) or a Centriprep 10 (Amicon Corp., Beverly, MA). For the 

5 Centricon 10 concentrators, a JA 20 rotor (Beckman, Fullerton, CA) in a J2-21 
Beckman centrifuge was used at 6000 rpm for 60 minutes. For the Centriprep 10 
concentrators, a swinging bucket rotor in a J-6B Beckman centrifuge was used at 
3000 rpm for 60 minutes. Final volumes for various protein samples prepared for 
crystallization studies described herein ranged from about 0.1 to 1 mL, and the 

10 protein concentrations were generally between about 10 and 20 mg/mL. Protein 
solutions may be diluted or concentrated for crystallization studies, for example, from 
about 5 to about 50 mg/ml. 

Example 2: Structure Determination of a Crystallized BPI 
Protein 

15 Presented herein is the crystal structure of BPI and two bound phospholipids 

at 2.4 A resolution. Our model provides the first structural information on the LPS- 
binding and lipid transport protein family and suggests a common mode of lipid 
binding for its members. 

Purified, full-length, non-glycosylated, recombinant human BPI expressed in 

20 CHO cells was crystallized by hanging-drop vapor diffusion at room temperature. 
The protein concentration was 8.5 mg/ml and the crystallization buffer contained 12% 
(w/v) PEG 8000, 200 mM magnesium acetate, and 100 mM sodium cacodylate, pH 
6.8. Two crystal forms with slightly different cell dimensions grew under the same 
conditions in space group C 2 , with one molecule per asymmetric unit. Form 1 

25 crystals were reproducible and had cell dimensions of a= 185.0, fc=37.2, c=84.3 A, 
and /3=101.3*. Form 2 crystals appeared rarely and had cell dimensions of 
a= 185.6, &=33.0, c=85.2 A, and 0 = 101.6* 

For Table 1, x-ray diffraction data were collected at room temperature with 
the R-AXIS IIC imaging plate area detector mounted on a Rigaku RU200 rotating 
30 anode x-ray generator. Data were processed with DENZO and SCALEPACK 
[Z. Otwinski, in Proceedings of CCP4 Study Weekend: Data Collection and 
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Processing, L. Sawyer, N. Isaacs, S. Baileys, Eds, (SERC Daresbury Laboratory, 
Warrington, UK, 1993), pp. 56], For form 1 crystals, a native data set to 2.8 A was 
collected from a single crystal, which was 92.4% complete overall (84.9% complete 
with an average I/a(I)=2.3 in the outermost resolution shell). A native data set to 
5 2.4 A for form 2 was collected from two crystals and was 92.7% complete overall 
(94.6% complete with an average I/a(I)=2.6 in the outermost shell). Because they 
could be reliably reproduced, form 1 crystals were used for all heavy atom soaks. 
The structure was solved by multiple isomorphous replacement (MIR) with anomalous 
scattering. Heavy atom sites were identified by difference-Patterson and difference- 

10 Fourier maps. Phase refinement was performed with [Collaborative Computational 
Project No. 4, Acta Crystallogr. , D50: 760 (1994)] producing a mean figure of merit 
(FOM) of 0.57. The MIR map (Fig. 3) was improved by density modification 
including solvent flattening, histogram matching, and phase extension using DM 
[Collaborative Computational Project No. 4, supra]. After a partial model was 

15 obtained with FRODO [T. A. Jones, /. AppL Crystallogr., 11:26% (1978)], phase 
combination was performed with SIGMAA [Collaborative Computational Project No. 
4, supra] (final FOM =0.89). CMNP is chloro-Hg-nitrophenol; DMM is dimethyl 
mercury; PCMBS is parachloromercury-benzene sulfonate; TELA is triethyl lead 
acetate. 
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5 Table 2 relates to model refinement and statistics. The model was refined at 

2.8 A through iterative cycles of simulated-annealing with X-PLOR {A. T. Brunger 
and A. Krukowski, Acta Crystallogr., ,446:585 (1990)] and manual rebuilding. 10% 
of the data were set aside before refinement began for R frec [A. T. Brunger Nature, 
355:472 (1992)] calculations. When the model had been refined to an R-factor* of 

10 20.4% (Rf r ec =: 32.6%) with the 2.8 A data, rigid-body minimization was performed 
against the 2.4 A data set (R=29.8% to 3.5 A after minimization). Additional cycles 
of simulated annealing, positional refinement, correlated individual temperature factor 
refinement, and manual rebuilding reduced the R-factor to 22.7% and R frec =31.3% 
(no intensity cutoff). An overall anisotropic temperature factor and bulk solvent 

15 correction were applied to the observed reflections when showed improvement. 
The model was confirmed by calculating simulated-annealing omit-maps for every 
part of the structure. The final model contains all 456 residues of the protein, 48 
well-ordered waters, and 2 molecules of phosphatidylcholine. Regions of the 
backbone with poor electron density include residues 148, 232-236, 258-260, and 

20 parts of the loop between residues 281-311. Sidechains with poorly defined density 
were truncated to alanine. The model was examined by the programs PROCHECK 
[R. A. Laskowski, M. W. McArthur, D. S. Moss, J. M. Thornton, /. AppL 
Crystallogr. 26:283 (1993)], VERIFY [R. Liithy, J. U. Bowie, D. Eisenberg, Nature, 
356:83 (1992)], and ERRAT [C. Colovos and T. Yeates, Protein ScL, 2:1511 

25 (1993)]. 
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TABLE 2: REFINEMENT STATISTICS FOR FORM 2 CRYSTALS 

Data 





Resolution (A) 


2.4 




Unique reflections (N) 


18,898 


5 


ComDleteness (%} 


92 7 




Atoms in model 






PrntPiTi ^noti-hvdropen^ 








in? 




waier 




in 


ivci lnciueiii parujiicierii 






Kesoiuiion range v/v 


^n n o yi 






LL. / 






31.3 




Avg. atomic B factors 




15 


Protein 


36.9 




Lipid N, C 


49.4, 51. Of 




Waters 


44.6 




rms$ deviation from ideality 






Bonds (A) 


.0.006 


20 


Angles (deg) 


1.4 




Dihedrals (deg) 


26.0 




Impropers (deg) 


1.2 



*R - 100 (Z | F^- F e F^) where F^ and F e are the observed and calculated structure factors, respectively. 
|B factors for the lipids bound in the NH 2 - and COOH-termina! domains. 
25 Jrms=TOot mean square 



BPI is a boomerang-shaped molecule with approximate dimensions of 135 by 
35 by 35 A (Fig. 1 , A and B). It consists of two domains of similar size (NH 2 - and 
COOH-terminal) that are connected by a proline-rich linker of 21 residues (positions 
230 to 250). The two domains form three structural units; barrels are found at each 

30 end of the protein, and a central /? sheet forms an interface between the barrels. The 
secondary structure and topology of the two domains are similar, giving the protein 
pseudo-twofold symmetry. 

Each barrel (residues 10 to 193 and 260 to 421) contains three common 
structural elements: a short a helix, a five-stranded antiparallel jS sheet, and a long 

35 helix (Fig. 2A), in that order. We call these elements helix A, sheet N and helix B 
in the NH 2 -terminal domain, and helix A', sheet C and helix B' in the COOH- 
terminal domain. Sheets N and C have a series of jS bulges that change the direction 
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of their strands and cause a pronounced curve in the sheets. In each domain, the long 
helix lies along the concave face of the sheet, with the helical axis at -60" to the 
strands of the (5 sheet. A single disulfide bond between Cys 135 and Cys 175 anchors 
helix B to the final strand of sheet N. Situated between the NH 2 - and COOH-terminal 
5 barrels is a twisted, seven-stranded antiparallel (3 sheet composed of four strands from 
the NH 2 -terminaI domain and three strands from the COOH-terminal domain. This 
central sheet forms an interface between the two domains and is thus reminiscent of 
several dimer interfaces stabilized by hydrogen bonds between strands of a 0 sheet 
[M. Leeson, B. Henderson, J. Gillig, J. Schwab and J. Smith, Structure, 4:253 

10 (1996); D. Ohlendorf, W. F. Anderson, M. Lewis, C. O. Pabo, B. W. Matthews, 
/. Mol Biol, 169:151 (1983); G. N. Reeke, J. W. Becker, G. M. Edelman, /. Biol 
Chem., 250:1525 (1975)]. 

The structural similarity of the two domains of BPI is shown by the 
superposition [G. H. Cohen, J. Mol BioL, 190:593 (1986)] in Fig. 2B; they are 

15 related by a rotation of 173* and have a root mean square deviation (rmsd) of 3.0 A 
on the basis of superposition of 169 Cot pairs. The structure shared by these two 
domains does not resemble other protein folds; several structural alignment programs 
[N. N. Alexandrov and D. Fischer, Proteins, 25:354 (1996); D. Fischer, C. J. Tsai, 
R. Nussinov, Protein Eng., 5:981 (1995); L. Holm, C. Sander, Nucl Acids Res., 

20 22:3600 (1996)] failed to reveal a significant match to any known folds. Significant 
differences between the superimposed domains are found in two loop regions 
containing residues 45 and 96 in the NH 2 -terminal domain and residues 280 and 348 
in the COOH-terminal domain. These differences may be functionally important 
because the loops around residues 45 and 96 in the NH 2 -terminal domain have been 

25 implicated in LPS binding and bactericidal activity (see below). This structural 
similarity of the two domains was unexpected, not only because of their lack of 
significant sequence identity ( < 20%), but also because of their functional differences. 
The NH 2 -terminal domain of BPI is cationic and retains the bactericidal, LPS-binding, 
and LPS-neutralization activities of the intact protein [A. H. Horwitz, et al., Protein 

30 Expr. and Purif., 8:2$ (1996); C. E. Ooi, J. Weiss, P. Elsbach, B. Frangione and 
B. Mannion, 7. Biol Chem., 262:14891-14894 (1987); C. E. Ooi, J. Weiss, M. E. 
Doerfler and P. Elsbach, 7. Exp. Med., 174:649 (1991)]. The COOH-terminal 
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domain is essentially neutral and shows limited LPS-neutralization activity [SVL. 
Abrahamson et al., J. Biol Chem., 272:2149 (1997)]. However, the structural 
similarity of the two domains may reflect a previously undetected functional 
similarity: each domain contains a binding pocket for a phospholipid. 
5 After the amino acid sequence had been traced in the electron density maps, 

two regions of extended electron density remained that could not be accounted for by 
protein atoms. This density, found in the interior of both domains, was present in 
the multiple isomorphous replacement (MIR) maps (Fig. 3) at an intensity similar to 
that of the protein density, and it became the predominant feature in F^-F^ maps 
10 after sequence fitting (both form 1 and form 2 crystals). Electrospray mass 
spectrometry of the sample used for crystallization revealed two molecules, with 
relative molecular masses of 522 and 787, in approximately equal amounts. Tandem 
mass spectrometric analysis was consistent with the two species being 
phosphoglycerides containing a phosphatidylcholine head group and either one or two 
15 18-carbon acyl chains with one double bond. Phosphatidylcholine (Fig. 4 A) is 
abundant in eukaryotic cells and is presumably bound by BP1 in the cells from which 
the protein is isolated. 

The two lipids are bound in extensive apolar pockets on the concave surface 
of the boomerang, situated between the NH 2 -terminal and COOH-terminal barrels and 
20 the central j8 sheet. In the NH 2 -terminal domain, the entrance to the pocket is formed 
by helices A and B. The back and sides are formed by sheet N and the central sheet. 
The two acyl chains insert - 15 A into the interior of the protein and are surrounded 
by apolar side chains (Fig. 4B). The head group lies at the entrance of the pocket 
and is exposed to solvent. The pocket in the COOH-terminal domain, which has a 
25 slightly larger opening, is formed by the analogous secondary structures. Both basic 
and acidic side chains found near the entrances of the pockets are available for 
electrostatic interactions with the zwitterionic head group. When the lipids are 
removed from the model, the pocket in NH 2 - terminal domain has a solvent accessible 
surface area [M. L. Connolly, Science, 227:709 (1983); M. L. Connolly, J. Am. 
30 Chem. Soc, 707:1118 (1985)] of 557 A\ and the pocket in the COOH-terminal 
domain has an area of 413 A 2 , for a total of 970 A 2 . The intensity of the electron 
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density for the two acyl chains in both pockets is similar and does not indicate 
whether the single acyl chain species is found predominantly in either pocket. 

The discovery of bound phospholipid in our structure suggests a possible site 
of interaction between BPI and LPS. As seen in Fig. 4A, phosphatidylcholine and 
5 LPS share some structural similarity, including negatively charged phosphate groups 
and, most notably, acyl chains. Since BPI's function is to bind a lipid, LPS, and 
since lipid is bound in pockets of BPI, it seems reasonable that the acyl chains of LPS 
bind in the apolar pockets. The following observations support his hypothesis: i) the 
acyl chains of lipid A are known to be essential for binding by BPI [H . Gazzano- 

10 \ Santoro et ah, Infection and Immunity, 65:2201 (1995)]; ii) the binding pockets of 
BPI are reminiscent of cavities in other lipid-binding proteins [L. Banaszak et al. f 
Adv. Protein Chem., 45:89 (1994)]; and iii) BPI has a significant sequence similarity 
to two lipid transfer proteins (see below). 

Our proposed site of interaction between BPI and the acyl chains of LPS 

15 differs from that suggested by previous work focusing on the NH 2 -terminal domain. 
Fragments containing the NH 2 -terminal domain of BPI have been identified with 
equivalent or greater bactericidal and LPS-binding activities relative to the full-length 
protein [A. H. Horwitz, et al., Protein Expression and Purification, 5:28 (1996); C. 
E. Ooi, J. Weiss, ML E. Doerfler and P. Elsbach, J. Exp. Med., 174:649 (1991)]. 

20 The activity of one NH 2 -terminal fragment was reduced when residues past positions 
12 or between positions 169 and 199 were deleted [C. Capodici and J. Weiss, 7. 
Immunol., 756:4789 (1996)]. The structure shows that these deletions affect elements 
of the barrel (at the beginning of helix A and from the middle to the end of helix B) 
and could significantly alter its structure. While the barrel seems to be the minimal 

25 structural unit with full activity, three smaller regions of this domain retain significant 
LPS-binding, LPS-neutralization, and bactericidal activity [R. G. Little, D. N. 
Kelner, E. Lim, D. J. Burke and P. J. Conlon, 7. Biol Chem., 265:1865 (1994)]: 
residues 17-45 (most of helix A and the first j3 strand of sheet N), residues 82-108 
(a j3 hairpin [residues 82 to 106 of BPI show limited sequence similarity with residues 

30 32 to 51 of the limulus anti-LPS factor (LALF), and have been predicted to form an 
amphipathic jS hairpin similar to that seen in the LALF structure [A. Hoess, S. 
Watson, G. R. Silber and R. Liddington, EMBO 72:3351 (1993)]. Although this 
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region of BPI does form a j3 hairpin, the strict amphipathic character of the loop seen 
in LALF is not maintained, and a structural superposition shows that the sequence of 
BPI must be shifted by one residue relative to the proposed sequence alignment] 
between strands 3 and 4 of sheet N), and residues 142-169 (a segment preceding and 

5 including part of helix B). These three regions include 18 basic residues (and only 
4 acidic residues) and form a positively charged tip on the NH 2 -teiminal domain (on 
the left of Fig. 1) which may make favorable electrostatic interactions with negatively 
charged groups of LPS. Further studies are necessary to determine the relative 
importance of the apolar pockets and positively charged NH 2 -terminal tip to BPI's 

10 LPS-binding and bactericidal activities. 

BPI is the first member of the mammalian LPS-binding and lipid transfer 
family to have its three-dimensional structure determined. BPI and LBP are related 
to two lipid transfer proteins, cholesteryl ester transfer protein (CETP) and 
phospholipid transfer protein (PLTP) [A. Tall, Anna. Rev. Biochem., 64:235 (1995)]. 

15 Alignment of the amino acid sequences of human BPI, LBP, CETP, and PLTP with 
BPFs secondary structure (Fig. 5, A and B) shows that structurally important residues 
are conserved in the four proteins. The two cysteines that form the single disulfide 
bond and are critical to the function of BPI [A. H. Horwitz, et al., Protein Expression 
and Purification, 5:28 (1996)] are completely conserved. Also, the pattern of 

20 hydrophobic/hydrophilic residues in the 0 strands indicates that the jS bulges 
responsible for the extensive sheet twisting are preserved. The conserved sequences 
strongly suggest that members of the LPS-binding and lipid transfer family share 
BPI's two-domain structure and that the two domains are similar in topology. 

It is likely that the lipid transfer proteins will also share the apolar binding 

25 pockets found in BPI. Striking parallels are found between our BPI- 
phosphatidylcholine structure and previous work showing that CETP copurifies with 
an equimolar amount of phosphatidylcholine [A. Tall, Annu. Rev. Biochem., 64:235 
(1995)] and has two distinct binding sites [S. Wang, L. Deng, R. W. Milne and A. 
R. Tall, J. Biol. Chem., 267:17487 (1992)] one for neutral lipids and another for 

30 phospholipids. The known ligands of CETP and PLTP (cholesteryl esters, 
triglycerides, retinyl esters and phospholipids) all contain at least one acyl chain 
which could bind in apolar pockets similar to those in BPI, suggesting a common 
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mode of ligand binding in this family. Sequestration of these hydrophobic chains in 
interior pockets may be critical to the function of the lipid transfer proteins: transfer 
of apolar ligands in an aqueous environment. Thus, the structure of BPI illuminates 
the action of the plasma lipid transfer proteins, as well as offering possibilities for 
5 how BPI and LBP interact with LPS. 

Example 3: Molecular Modeling of BPI Ligands and Mimetics 
We have used the information derived from the X-ray crystal structure of BPI 
presented herein, along with the teachings of the art, including, for example, 
^ " WO94/20532 (PCT/US94/02465) to design various BPI-related proteins and peptides. 
10 These constructs may be divided into categories as illustrated below, including 
peptides and proteins, including fragments, analogs and variants of the protein, since 
they best describe the different ways in which different domains and portions may be 
assembled to achieve new molecules. 

1 . Individual Peptide Domains: The overlapping BPI peptide data indicated that 

15 the N-terminal domain of BPI contains at least three independent functional domains 
that have one or more of the biological activities of BPI, including, for example, 
antibacterial, antifungal, anti-heparin and anti-angiogenic activities. Domain I is a 
region of amino acid residues from about 17 to about 45; Domain II is a region of 
amino acid residues from about 65 to about 99; and Domain III is a region of amino 

20 acid residues from about 142 to about 169. Hundreds of peptide sequences derived 
from these domains have been synthesized, including addition, deletion and 
substitution variants of the domain-derived sequences. Through further refinements, 
smaller "core" regions within these domains have been identified that still retain high 
levels of activity; for example, within Domain II residues 90-99 and within Domain 

25 III residues 148-161. 

These peptides have included linear molecules that may or may not assume a 
conformation that maximally express activity. From the X-ray structure data, 
segments of BPI are designed that should preserve the three-dimensional structure of 
these domains when constructed outside the context of the intact protein. For 

30 example, both Domains I and II contain hairpin loop structures that are positioned 
adjacent to one another in space on the proximal tip of the molecule. In contrast, 
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although Domain III is a helix + turn and not a loop, extensions from both ends of 
the domain are positioned near enough to each other to consider linking them 
together. As a result, peptides can be designed that reflect these structures by 
replacing selected residues in synthetic or recombinant peptides with cysteines, so as 
5 to create disulfide-stabiized domain mimetics. Since this approach is based upon the 
actual structure of BPI, it differs from those of other groups that are based upon 
putative alignments between BPI and the structure of proteins such as the Limulus 
amoebocyte lysate factor (LALF). As examples of these embodiments, listed below 
are a series of exemplary peptides that, with cysteines added to the N- and C-terminus 
10 could assume structures similar to those seen in the intact protein: from Domain I: 
residues 36-54; from Domain II: residues 84-109, 85-108, 86-107, 87-106, 88-105, 

89- 104, or 90-103; and from Domain III: residues 142-164. 

2. Peptide Domain Hybrids: It has also been demonstrated that certain peptide 
domain hybrids, which include repeats of the same sequence from a single domain 

15 or inter-domain combination of sequences, have enhanced activities. For example, 
linking Domain II- and Ill-derived peptides (such as XMP.29: 85-99:: 148-161) has 
enhanced biological activity. Interestingly, these domains in the crystal structure are 
closely associated in space, and peptide XMP.29 may represent a Domain II: :II1 
hybrid that actually shares some structural similarity to the intact protein. Based upon 

20 the X-ray structure, a Domain II-II1 peptide that consists of approximately residues 

90- 103:: 146-162 is constructed. Such a peptide may even more closely mimic what 
is seen in the protein. 

3. BPI "Tip" Mimetics: As discussed above, portions of all three peptide 
domains discovered and described in WO94/20532 come together on the proximal tip 

25 of the N-terminal fragment. As a result, a BPI "tip" mimetic is designed that 
essentially "slices" off the most extreme tip of the molecule but preserves the critical 
domain elements. Such a slice would lack the hydrophobic pockets found in the 
intact protein, but may exhibit activities beyond those of the individual segments. 
The following segments represent such a "slice" of the three peptide domains. 

30 However, to best preserve the geometry between the domains, it would be desirable 
to insert "linker" sequences between them so as to ensure proper positioning. By 
fixing these segments in space, programs such as Insightll (Molecular Simulations, 
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Inc.) can identify possible linker sequences by i) searching protein databases for 
similar structures or ii) de novo designing appropriate linkers. In this regard it may 
be desirable to utilize residues that are not readily susceptible to proteolysis (Ala, Ser, 
Gly, etc.), or to utilize amino acids like Pro that impose additional spatial constraints 
5 on peptide structure. An exemplary peptide consists of Domain I-II-IIl-derived 
elements: 37-54:90-104:144-162. 

Similarly, by analogy with the above-described cyclic domain structures, the 
fact that residues 37 and 162 are positioned near each other in the protein suggests 
that a cyclic tip mimetic could be created by replacing these residues with cysteines, 
10 for example, Cys::38-54:90-104: 144-161 ::Cys. 

4. Extended N-terminal Domains of BPI: The three dimensional structure of 
BPI indicates that the molecule, which forms N- and C-terminal domains, can be 
divided into three structural units as described in Example 2. Two of these units 
represent the N- and C-terminal "barrels" that are formed by residues 10-193 and 

15 260-421, respectively, whereas the third element is a central fi sheet structure that 
forms the interface between the two barrels. Of interest is the fact that the two bound 
phospholipids in BPI occupy spaces between the two barrel structures and the central 
6 sheet. As a result, the recombinant BPI protein product rBPI 2I . which essentially 
contains residues 1-193 lacks some of the structural components necessary to form 

20 a complete hydrophobic pocket. A new molecule is constructed that encodes residues 
1 to approximately 260 which would contain most of the residues necessary to form 
a complete hydrophobic pocket. 

5. Mutants for Immobilizing rBPI 21 : One application for BPI protein products 
is their use as affinity removal ligands for endotoxin in solution. For example, 

25 immobilizing a BPI protein on a column or membrane matrix would allow removal 
of endotoxin from endotoxin-contaminated solutions by simply passing those solutions 
over the immobilized BPI protein. Some of the cysteine-mutated peptides described 
above may be useful for this purpose, as well as rBPI 23 . Alternatively, in order to 
selectively couple a stable, readily produced protein like rBPI 2 i to a column or 

30 membrane, a cysteine could be added to the N- or C-terminus, thus allowing site- 
specific conjugation and selective orientation of the binding "tip" away from the solid 
support. Such a construct is alternatively constructed by adding a short linker 
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segment (such as Gly-Gly-Gly-Ser) to the C-terminal residue of the BPI protein 
product, for example, residue 193 of rBPl(l-193), followed by a cysteine residue. 
Such a construct would have a high probability of folding correctly, given the domain 
nature of residues 1-193, and be readily conjugatable. Similarly, a series of new 
5 conjugates between rBPl(l-193)C and other thiol-containing proteins or molecules is 
created for the purposes of evaluating new molecules. 

6. New N-terminal Dimeric Molecules: As an extension of the above analysis, 
a new series of N-terminal dimeric molecules can be constructed that take better 
advantage of the hydrophobic pockets. For example, by replacing the C-terminal 

10 barrel with another copy of the N-terminal barrel, an analog of BPI would be created 
that contains two functional barrels and possibly two functional hydrophobic pockets. 
One such dimer could be constructed by replacing residues 260-456 with residues 1- 
193. Alternatively, other more central locations may be identified within the 6 sheet 
structure where symmetry would dictate additional and even better points for 

15 duplication. 

7. C-terminal Fusion Proteins: The C-terminal domains of LBP and CETP 
appear to mediate interactions with CD 14 and lipoproteins. Similarly, the C-terminal 
domain of BPI has LPS binding and neutralization activities. As a consequence, the 
C-terminal barrel of BPI (or other family members) could be fused to barrels or 

20 domains of other family members and/or to other proteins to alter/modify/enhance 
their action. 

8. Homology modeling of BPI family members: The BPI coordinates have 
been useful in the generation of molecular models of other members of the BPI 
protein family. Utilizing CLUSTAL (a multiple sequence alignment program in PC- 

25 Gene) and the Homology module (a structure-based homology modeling program in 
Insightll on the SGI), molecular models (and the corresponding three-dimensional 
coordinates files) of Iipopoly saccharide binding protein (LBP), cholesteryl ester 
transfer protein (CETP) and phospholipid transfer protein (PLTP) have been 
generated. With these files, existing mutants are mapped and new ones designed. 

30 Published data [Wang et al., Biochemistry 50:3484-3490, (1991)] indicate that 
insertional alterations in three locations of CETP severely impaired cholesteryl ester 
transfer activity: residues 48-53, residue 165, and residues 373-379. Since residues 
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48-53 and residue 165 of CETP coincide structurally with Domains I and III of BPI 
respectively, the functional domain structure of BPI extends to the other protein 
family members. Similarly, by virtue of the symmetry between the N- and C- 
terminal domains, it is likely that the corresponding residues on the Oterminal tip of 
5 BPI are involved in recognizing receptors and/or interacting with lipoproteins. 

9. Lipid Pocket Mutants: A detailed compilation of the residues in BPI 
which form the pockets is described in Table 3 as follows. Column 1 of Table 3 
indicates the residue name and number. Column 2 shows checked residues which 
indicate the residues that show a change in solvent accessible surface area with lipid 

10 binding. Column 3 shows checked residues that indicate the residues that have some 
atom within 4 A of a lipid atom. If the contact is to the head-group of the lipid, the 
residues are listed at the end, under ENTRY ONLY. Column 4 indicates 
conservation in 3 BPI and 4 LBP sequences, e.g., for He 68, in 3 of the 7 sequences, 
the residue is similarly He; for the other 4 sequences, the residue is Leu (see also 

15 column 5). Column 5 indicates alternative residues which occur in BPI or LBP at 
that position for the 7 BPI and LBP sequences analyzed. Column 6 indicates residues 
for mutations to block the pockets, using residues selected to be well-conserved 
(especially in the N-terminal domain) and relatively small. The suggested mutations 
are all to large sidechains in order to decrease the size of the pocket by as much as 

20 possible. 



TABLE 3 



N-TERMINAL POCKET - residues contributing to interior 


Residue 1 


ASA 2 


Within 
4A 3 


Conserva- 
tion 4 


Alterna- 
tives 5 


Mutations 6 


Val 7 


V 


V 


2/7 


A,T 




He 9 


V 


V 


All 




W 


Gly 13 


V 


V 


All 






Leu 14 


V 




All 






Tyr 16 


V 




All 






Ala 17 


V 


V 


AH 




F 



WO* 98/58961 



63 



PCT/US98/13007 < 

> 





Ser 18 


V 


V 


1/7 


C,A 






Gly21 


V 


V 


All 




F 




Ala 24 


V 


V 


4/7 


V,T,S 


Y,H 




Leu 25 


V 




All 






5 


lie 68 


V 


V 


3/7 


L 






Leu 76 


V 


V 


All 








Phe78 


V 


V 


1/7 


V,L 






Leu 117 


V 


V 


All 








Leu 119 


V 




All 






10 


Pro 128 


V 




5/7 


A,S 






lie 130 


V 




1/7 


V 






Val 178 


V 




2/7 


L,l 






Val 182 


V 




All 








Glu 185 


V 


V 


1/7 (allele) 


K,D,H 




15 


Leu 186 


V 


V 


All 




W 




Tyr 189 


V 


V 


All 








Phe 190 


V 


V 


2/7 


V,L 






Leu 193 


V 


V 


All 








Leu 220 


V 


V 


All 






20 


Val 222 


V 




5/7 


M,W 






Val 254 


V 


V 


6/7 


I 






Leu 256 


V 




3/7 


F 






Pro 428 


V 


V 


All 








Thr429 


V 


V 


1/7 


M,L 




25 


Pro 430 


V 


V 


5/7 


L 






Val 433 


V 




3/7 


I 






Leu 435 


V 




All 








Val 453 


V 




6/7 


I 






ENTRY ONLY 


30 


Gin 20 


V . 


V 


3/7 


E 
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Lys 27 


V 


V 


3/7 


R,S 




Glu28 


V 




6/7 


K 




Arg31 


V 




2/7 


K,E 




Ser 181 


V 


V 


5/7 


T,A 




Arg 432 


V 


V 


3/7 


K.Y.H 




Tyr 455 


V 


V 


6/7 


H 




C-TERMINAL POCKET - residues contributing to interior 


Phe 263 


V 


V 


All 






Asn 264 


V 




All 






Ala 266 


V 


V 


All 




F 


Gly 267 


V 


V 


2/7 


A,S,T, 




Val 275 


V 


V 


1/7 


A,Y 




Leu 276 


V 


V 


5/7 


F,W 


F,W 


Lys 277 


V 




1/7 


G,N 




Met 278 


V 


V 


1/7 


L,F 




Val 318 


V 




1/7 


L.I.G 




Ala 320 


V 




111 


V 




Pro 324 


V 


V 


6/7 


Q 




Leu 326 


V 


V 


6/7 


V 




Phe 335 


V 


V 


1/7 


L.V.E 




Pro 337 


V 


V 


5/7 


A,F 




Val 339 


V 




211 


L,M 




Met 360 


V 


V 


2/7 


L,V 




Thr362 


V 


V 


5/7 


L 




Val 368 


V 


V 


in 


I,L 




Leu 375 


V 


V 


3/7 


i,v 




Val 376 


V 


V 


2/7 


I,T 




Gly 377 


V 




All 






Leu 379 


V 


V 


All 






Leu 381 


V 


V 


3/7 


P 
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Val 409 


v 


V 


1/7 


I MI 




Val 413 


v 


Y 


1/7 

ill 


F L 


p 

1 


Val 417 


V 


V 


3/7 


I,F 


W 


Lys 420 


V 


V 


5/7 


E 


Y,H 


Leu 421 


V 


V 


5/7 


I,F 




Phe 425 


V 


V 


6/7 


L 




ENTRY ONLY 


Asp 200 


V 




All 






Ser201 


V 


V 


4/7 


K.T.N 




Val 202 


V 




4/7 


F,I 




Tyr270 


V 


V 


All 






Arg 416 


V 


V 


1/7 


K,V,D 




Lys 423 


V 




3/7 


R.E.Q 





1 Residue name and number. 

15 2 Checked residues show a change in solvent accessible surface area with lipid 
binding. 

3 Checked residues have some atom within 4 A of a lipid atom (if the contact is 
to the head group of the lipid, the residues are listed at the end, under ENTRY 
ONLY). 

20 4 Conservation in 3 BP1 and 4 LBP sequences, e.g., for He 68, in 3 of the 7 
sequences, the residue is similarly He; for the other 4 sequences, the residue is 
Leu (see note 5). 

5 Indicates alternative residues which occur in BPI or LBP at that position for the 
7 BPI and LBP sequences analyzed. 

25 6 Indicates residues for mutations to block the pockets using residues selected to 
be well-conserved (especially in the N-terminal domain) and relatively small. 
The suggested mutations are all to large sidechains in order to decrease the size 
of the pocket by as much as possible. 
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TABLE 4 





ATOM 


1 


CB 


VAL 


A 


1 


95.942 


11. 


564 


18. 


390 


1. 


00 


51.23 




ATOM 


2 


CGI 


VAL 


A 


1 


97.217 


12. 


313 


18. 


805 


1. 


00 


48.90 


J 


a TOM 


3 


CG2 


VAL 


A 


1 


94.694 


12 . 


383 


18 . 


720 


1 . 


00 


50.67 




ATOM 


4 


C 


VAL 


A 


1 


96.124 


12. 


502 


16. 


083 


i. 


00 


47.46 


10 


ATOM 


5 


0 


VAL 


A 


1 


97.238 


12. 


990 


15. 


911 


l. 


00 


49.54 




ATOM 


8 


N 


VAL 


A. 


1 


97.161 


10. 


372 


16. 


554 


i. 


00 


51.03 




ATOM 


10 


CA 


VAL 


A 


1 


95.982 


11. 


216 


16. 


876 


1. 


00 


48.79 


































AxUM 


i i 

X J- 


TJ 
vt 


ASN 




2 


95. 020 


13 . 


016 








00 


42 . 80 




ATOM 


13 


CA 


ASN 


A 


2 


95.067 


14. 


266 


14. 


810 


l. 


00 


38.18 


20 


ATOM 


14 


CB 


ASN 


A 


2 


93.709 


14. 


566 


14. 


185 


i. 


00 


39.01 




ATOM . 


15 


CG 


ASN 


A 


2 


93.494 


13. 


831 


12. 


892 


i. 


00 


40.06 




ATOM 


16 


OD1 


ASN 


A 


2 


94 .323 


13 . 


020 


12. 


487 


l. 


00 


42.94 


25 
































ATOM 


17 


ND2 


ASN 


A 


2 


o o n 




108 


12 . 


228 


i . 


A A 

uu 


*5 A 4 Q 

35? . 4y 




ATOM 


20 


C 


ASN 


A 


2 


95.439 


15. 


376 


15. 


786 


i. 


00 


34.97 


30 


ATOM 


21 


O 


ASN 


A 


2 


94.946 


15 . 


399 


16. 


916 


i. 


00 


37.36 




ATOM 


22 


N 


PRO 


A 


3 


96.373 


16. 


260 


15. 


397 


i. 


00 


30.80 




ATOM 


23 


CD 


PRO 


A 


3 


97.144 


16. 


.244 


14 . 


143 


i. 


00 


28.90 


35 
































ATOM 


24 


CA 


FKU 


A 


j 


Q/r one 


X / . 


Jo / 


lO . 


*5 C *> 
. 2D2 




rift 

. uu 


^o.xy 




ATOM 


25 


CB 


PRO 


A 


3 


98.083 


17, 


.830 


15. 


571 


1. 


00 


26.30 


40 


ATOM 


26 


CG 


PRO 


A 


3 


97.765 


17, 


,619 


14, 


.128 


1, 


,00 


26.17 




ATOM 


27 


C 


PRO 


A 


3 


95.765 


18, 


.482 


16, 


.283 


1. 


00 


28.00 




ATOM 


28 


O 


PRO 


A 


3 


95.104 


18 


.758 


15 


.275 


1 


.00 


28.89 


A C 

45 
































ATOM 


29 


N 




JV 

j\ . 


A 






. xuo 




AAA 


X . 


on 


C D ■ O X 




ATOM 


31 


CA 


GLY 


A 


4 


94.653 


20 


.182 


17 


.588 


1 


.00 


25.44 


50 


ATOM ' 


32 


C 


GLY 


A 


4 


95.178 


21 


.508 


17 


.091 


1 


.00 


25.34 




ATOM 


33 


0 


GLY 


A 


4 


94.407 


22 


.425 


16 


.831 


1 


.00 


28.37 




ATOM 


34 


N 


VAL 


A 


5 


96.494 


21 


.621 


16 


.981 


1 


.00 


23.98 




































-> o 


PA 


VAL 


A 


5 


97 . 134 


22 


. 842 


16 


.519 


1 


.00 


21 . 80 




ATOM 


37 


CB 


VAL 


A 


5 


97.671 


23 


.681 


17 


.689 


1 


.00 


18.63 


60 


ATOM 


38 


CGI 


VAL 


A 


5 


98.573 


24 


.785 


17 


.171 


1 


.00 


20.04 




ATOM 


39 


CG2 


VAL 


A 


5 


96.526 


24 


.283 


18 


.468 


1 


.00 


20.89 




ATOM 


40 


C 


VAL 


A 


5 


98.308 


22 


.436 


15 


.660 


1 


.00 


24 .04 


65 
































ATOM 


41 


o 


VAL 


A 


5 


99.014 


21 


.486 


15 


.984 


1 


.00 


27.65 




ATOM 


42 


N 


VAL 


A 


6 


98.492 


23 


.139 


14 


.551 


1 


.00 


25.94 
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ATOM 


44 


CA 


VAL A 


6 


99.589 


22.874 


13.633 


1. 


00 


26. 


54 


ATOM 


45 


CB 


VAL A 


6 


99.082 


22.292 


12.294 


1. 


00 


29. 


26 


ATOM 


46 


CGI 


VAL A 


6 


100.253 


22.038 


11.357 


1. 


00 


31. 


93 


ATOM 


47 


CG2 


VAL A 


6 


98.318 


21.001 


12.535 


1. 


00 


30. 


52 


ATOM 


48 


C 


VAL A 


6 


100.289 


24.191 


13.355 


1. 


00 


24. 


99 


ATOM 


49 


O 


VAL A 


6 


99.63B 


25.202 


13.135 


1. 


00 


26. 


97 


ATOM 


50 


N 


VAL A 


7 


101.611 


24.183 


13.416 


1. 


00 


26. 


91 


ATOM 


52 


CA 


VAL A 


7 


102.404 


25.371 


13.156 


1. 


00 


27. 


18 


ATOM 


53 


CB 


VAL A 


7 


103.401 


25.637 


14.298 


1. 


00 


27. 


59 ' 


ATOM 


54 


CGI 


VAL A 


7 


104.343 


26.773 


13.925 


1. 


00 


27. 


21 


ATOM 


55 


CG2 


VAL A 


7 


102.659 


25.967 


15.575 


1. 


00 


27. 


05 


ATOM 


56 


C 


VAL A 


7 


103.190 


25.103 


11.889 


1. 


00 


29. 


61 


ATOM 


57 


0 


VAL A 


7 


103.863 


24.084 


11.777 


1. 


00 


32. 


24 


ATOM 


58 


N 


ARG A 


8 


103.077 


25.994 


10.917 


1. 


00 


33. 


04 


ATOM 


60 


CA 


ARG A 


8 


103.801 


25.841 


9.668 


1. 


00 


33. 


25 


ATOM 


61 


CB 


ARG A 


8 


102.842 


25.631 


8.494 


1, 


00 


35. 


70 


ATOM 


62 


CG 


ARG A 


8 


102.195 


24.244 


8.485 


1. 


00 


41. 


64 


ATOM 


63 


CD 


ARG A 


8 


101.309 


24.004 


7.259 


1. 


00 


44. 


24 


ATOM 


64 


NE 


ARG A 


8 


102.049 


23 .578 


6.073 


1, 


,00 


46. 


.00 


ATOM 


66 


CZ 


ARG A 


8 


101.565 


23.626 


4.835 


1, 


,00 


51, 


.86 


ATOM 


67 


NH1 


ARG A 


8 


100.341 


24.091 


4.611 


1. 


.00 


52 


.72 


ATOM 


70 


NH2 


ARG A 


8 


102.293 


23 .177 


3.820 


1. 


.00 


53 


.10 


ATOM 


73 


C 


ARG A 


8 


104.682 


27.052 


9.441 


1 


.00 


32 


.06 


ATOM 


74 


0 


ARG A 


8 


104.196 


28.168 


9.288 


1 


.00 


32 


.62 


ATOM 


75 


N 


ILE A 


9 


105.986 


26.830 


9.515 


1 


.00 


31 


.21 


ATOM 


77 


CA 


ILE A 


9 


106.955 


27.888 


9.315 


1 


.00 


31 


.97 


ATOM 


78 


CB 


ILK A 


9 


108.210 


27.633 


10.167 


1 


.00 


32 


.76 


ATOM 


79 


CG2 


ILE A 


9 


109.208 


28.781 


10.012 


1 


.00 


29 


.90 


ATOM 


80 


CGI 


ILE A 


9 


107.803 


27.454 


11.633 


1 


.00 


32 


.76 


ATOM 


81 


CD1 


ILE A 


9 


108.945 


27.087 


12.554 


1 


.00 


36 


.13 


ATOM 


82 


C 


ILE A 


9 


107.309 


27.889 


7.832 


1 


.00 


34 


.24 


ATOM 


83 


0 


ILE A 


9 


107.650 


26.845 


7.265 


1 


.00 


37 


.01 


ATOM 


84 


N 


SER A 


10 


107.179 


29.045 


7.195 


1 


.00 


34 


.50 
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ATOM 


86 


CA 


SER 


A 


10 


107. 


473 


29. 


173 


5. 


774 


1. 


00 


33. 


98 


ATOM 


87 


CB 


SER 


A 


10 


106. 


536 


30. 


200 


5. 


146 


1. 


00 


33. 


86 


ATOM 


88 


OG 


SER 


A 


10 


106. 


778 


31. 


488 


5. 


679 


1. 


00 


36. 


26 


ATOM 


90 


C 


SER 


A 


10 


108. 


922 


29. 


563 


5. 


500 


1. 


00 


32. 


91 


ATOM 


91 


0 


SER 


A 


10 


109. 


674 


29. 


866 


6. 


416 


1. 


00 


33. 


57 


ATOM 


92 


N 


GLN 


A 


11 


109. 


300 


29. 


550 


4. 


225 


1. 


00 


34. 


16 


ATOM 


94 


CA 


GLN 


A 


11 


110. 


653 


29. 


912 


3. 


811 


1. 


00 


35. 


54 


ATOM 


95 


CB 


GLN 


A 


11 


110. 


792 


29. 


845 


2. 


283 


1. 


00 


35. 


39 


ATOM 


96 


CG 


GLN 


A 


11 


112. 


212 


30. 


070 


1. 


746 


1. 


00 


36. 


23 


atoSt- 


97 


CD 


GLN 


A 


11 


113. 


164 


28. 


920 


2. 


040 


1. 


00 


35. 


94 


ATOM 


98 


OEl 


GLN 


A 


11 


112. 


880 


27. 


774 


1. 


716 


1. 


00 


38. 


97 


ATOM 


99 


NE2 


GLN 


A 


11 


114. 


310 


29. 


230 


2. 


624 


1. 


00 


35. 


28 


ATOM 


102 


C 


GLN 


A 


11 


110. 


991 


31. 


312 


4. 


316 


1. 


00 


36. 


60 


ATOM 


103 


0 


GLN 


A 


11 


112. 


116 


31. 


559 


4. 


740 


1. 


00 


41. 


10 


ATOM 


104 


N 


LYS 


A 


12 


110. 


013 


32. 


216 


4. 


305 


1. 


00 


35. 


51 


ATOM 


106 


CA 


LYS 


A 


12 


110. 


245 


33. 


573 


4 . 


788 


1. 


00 


31. 


96 


ATOM 


107 


CB 


LYS 


A 


12 


109. 


,005 


34. 


445 


4. 


612 


1. 


00 


32. 


24 


ATOM 


108 


CG 


LYS 


A 


12 


109. 


226 


35. 


876 


5. 


065 


1. 


00 


32. 


71 


ATOM 


109 


CD 


LYS 


A 


12 


107. 


953 


36. 


685 


5. 


.063 


1. 


00 


31. 


87 


ATOM 


110 


CE 


LYS 


A 


12 


108, 


.227 


38. 


085 


5. 


.573 


1. 


00 


34. 


67 


ATOM 


111 


N2 


LYS 


A 


12 


106. 


.998 


38. 


914 


5. 


.627 


1. 


00 


37. 


88 


ATOM 


115 


C 


LYS 


A 


12 


110. 


,612 


33 . 


498 


6, 


.259 


1. 


00 


30. 


93 


ATOM 


116 


0 


LYS 


A 


12 


Ill- 


.482 


34. 


.227 


6, 


.728 


1. 


.00 


32, 


.37 


ATOM 


117 


N 


GLY 


A 


13 


109, 


,961 


32, 


.586 


6. 


.973 


1. 


.00 


30. 


47 


ATOM 


119 


CA 


GLY 


A 


13 


110, 


.231 


32, 


.407 


8. 


.385 


1- 


.00 


27. 


.64 


ATOM 


120 


C 


GLY 


A 


13 


111 


.593 


31 


.791 


8 


.610 


1 


.00 


29. 


.49 


ATOM 


121 


0 


GLY 


A 


13 


112 


.312 


32 


.184 


9 


.531 


1 


.00 


30. 


.35 


ATOM 


122 


N 


LEU 


A 


14 


111 


.952 


30 


.817 


7 


.779 


1 


.00 


28 


.87 


ATOM 


124 


CA 


LEU 


A 


14 


113 


.248 


30 


.170 


7 


.912 


1 


.00 


28 


.53 


ATOM 


125 


CB 


LEU 


A 


14 


113 


.361 


28 


.941 


7 


.011 


1 


.00 


25 


.60 


ATOM 


126 


CG 


LEU 


A 


14 


112 


.564 


27 


.698 


7 


.419 


1 


.00 


25 


.99 


ATOM 


127 


CD1 


LEU 


A 


14 


112 


.959 


26 


.547 


6 


.531 


1 


.00 


28 


.34 


ATOM 


128 


CD2 


LEU 


A 


14 


112 


.829 


27 


.330 


8 


.855 


1 


.00 


23 


.90 


ATOM 


129 


C 


LEU 


A 


14 


114 


.359 


31 


.155 


7 


.613 


1 


.00 


28 


.63 
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ATOM 


130 


O 


LEU A 


14 


115. 


346 


31. 


207 


8.335 


1. 


00 


31. 


86 


ATOM 


131 


N 


ASP A 


15 


114. 


176 


31. 


972 


6.582 


1. 


00 


31. 


30 


ATOM 


133 


CA 


ASP A 


15 


115. 


176 


32. 


975 


6.215 


1. 


00 


33. 


98 


ATOM 


134 


CB 


ASP A 


15 


114. 


695 


33. 


832 


5.033 


1. 


00 


36. 


03 


ATOM 


135 


CG 


ASP A 


15 


114. 


818 


33. 


127 


3.681 


1. 


00 


36. 


52 


ATOM 


136 


OD1 


ASP A 


15 


115. 


662 


32. 


209 


3.515 


1. 


00 


37. 


01 


ATOM 


137 


OD2 


ASP A 


15 


114. 


071 


33. 


530 


2.765 


1. 


00 


36. 


25 
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11.890 
11.773 
10.576 
12.884 
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12.844 
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13 .511 
14.065 
13 .334 
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1.00 48.91 
1.00 53.18 
1.00 52.27 
1.00 39.36 
1.00 40.86 
1.00 38.66 
l.OO 35.83 
1.00 34.68 
1.00 34.59 
1.00 35.67 
1.00 35.88 
1.00 35.13 
1.00 35.15 
1.00 35.41 
1.00 34.03 
1.00 36.61 
1.00 41.84 
1.00 45.83 
1.00 49.70 
1.00 52.85 
1.00 32.08 
1.00 30.72 
1.00 34.05 
1.00 33.03 
1.00 30.64 
1.00 26.32 
1.00 27.43 
1.00 29.44 
1.00 33.62 
1.00 32.69 
1.00 33.78 
1.00 33.21 
1.00 34.60 
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ATOM 2387 CB TYR A 255 100.726 29.448 8.784 1.00 32.98* 

ATOM 2388 CG TYR A 255 100.746 30.482 7.694 1.00 32.36 

ATOM 2389 CD1 TYR A 255 101.913 30.756 6.987 1.00 31.99 

ATOM 2390 CE1 TYR A 255 101.921 31.674 5.947 1.00 33.10 

ATOM 2391 CD2 TYR A 255 99.587 31.154 7.335 1.00 31.03 

ATOM 2392 CE2 TYR A 255 99.583 32.072 6.298 1.00 32.21 

ATOM 2393 CZ TYR A 255 100.750 32.325 5.607 1.00 33.29 

ATOM 2394 OH TYR A 255 100.741 33.230 4.573 1.00 38.59 

ATOM 2396 C TYR A 255 100.559 28.955 11.195 1.00 31.73 

ATOM 2397 O TYR A 255 101.156 27.892 11.330 1.00 32.57 

ATOM 2398 N LEU A 256 99.490 29.279 11.914 1.00 31.36 

ATOM 2400 CA LEU A 256 98.916 28.370 12.893 1-00 29.13 

ATOM 2401 CB LEU A 256 98.878 28.992 14.280 1.00 28.40 

ATOM 2402 CG LEU A 256 100.206 29.125 15.014 1.00 28.89 

ATOM 2403 CD1 LEU A 256 101.045 30.208 14.375 1.00 32.20 

ATOM 2404 CD2 LEU A 256 99.938 29.462 16,471 1.00 32.12 

ATOM 2405 C LEU A 256 97.509 27.984 12.489 1.00 30.86 

ATOM 2406 O LEU A 256 96.748 28.814 11.987 1.00 30.77 

ATOM 2407 N GLY A 257 97.203 26.701 12.652 1.00 31.62 

ATOM 2409 CA GLY A 257 95.888 26.180 12.345 1.00 29.36 

ATOM 2410 C GLY A 257 95.287 25.827 13.688 1.00 29.99 

ATOM 2411 O GLY A 257 95.662 24.830 14.295 1.00 32.08 

ATOM 2412 N LEU A 258 94.402 26.686 14.177 1.00 29.99 

ATOM 2414 CA LEU A 258 ' 93.742 26.507 15.463 1.00 26.58 

ATOM 2415 CB LEU A 258 93.562 27.871 16.101 1.00 26.90 

ATOM 2416 CG LEU A 258 94.866 28.668 16.049 1.00 28.57 

ATOM 2417 CD1 LEU A 258 94.583 30.144 16.176 1.00 29.14 

ATOM 2418 CD2 LEU A 258 95.808 28.177 17.139 1.00 29.78 

ATOM 2419 C LEU A 258 92.397 25.838 15.253 1.00 28.27 

ATOM 2420 O LEU A 258 91.404 26.499 14.936 1.00 29.90 

• ATOM 2421 N SER A 259 92.370 24.523 15.434 1.00 28.81 

ATOM 2423 CA SER A 259 91.164 23.726 15.229 1.00 30.63 

ATOM 2424 CB SER A 259 91.491 22.237 15.348 1.00 31.03 

ATOM 2425 OG SER A 259 91.783 21.889 16.689 1.00 33.43 
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A 922 
A 923 
A 924 
A 925 
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A 940 
A 941 
A 942 
A 943 
A 944 
A 945 
A 946 
A 947 
A 948 



80.378 
80.215 
79.054 
65.692 
88.914 
100.082 
76 .929 
171.524 
108.006 
141.049 
110.883 
74.360 
68.751 
59.758 
95.173 
111.623 
105.604 
97.160 
108.492 
135.408 
91.469 
164.580 
157.888 
121.776 
96.503 
96.403 
88.114 
81.217 
126.713 
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23 .460 
20.293 
8 .640 
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81.97 
29.12 
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65.22 
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86.93 
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49.31 
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35.61 
75.14 
43.22 
61.69 
48.05 
53.02 
71.57 
58.62 
76.16 
61.58 
41.59 
48.97 
63.61 
61.93 
64.71 
76.51 
52.33 
35.93 
82.50 
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REFINEMENT . 
PROGRAM 
AUTHORS 



X-PLOR (online) 
BRUNGER . 



3.843 



DATA USED IN REFINEMENT. 
RESOLUTION RANGE HIGH (ANGSTROMS) 
RESOLUTION RANGE LOW (ANGSTROMS) 
DATA CUTOFF (SIGMA (F) ) 

DATA CUTOFF HIGH (ABS (F) ) 

DATA CUTOFF LOW (ABS (F) ) 

COMPLETENESS (WORKING+TEST ) (%) 
NUMBER OF REFLECTIONS 

FIT TO DATA USED IN REFINEMENT. 
CROSS-VALIDATION METHOD 
FREE R VALUE TEST SET SELECTION 
R VALUE (WORKING SET) 

FREE R VALUE 

FREE R VALUE TEST SET SIZE (%) 
FREE R VALUE TEST SET COUNT 
ESTIMATED ERROR OF FREE R VALUE 



2.40 
50.00 
0.0 

100000.00 
0.010000 
92.7 
18908 



THROUGHOUT 

RANDOM 

0.225 

0 .295 

10.2 

1926 
0.007 



FIT IN THE HIGHEST RESOLUTION BIN. 
TOTAL NUMBER OF BINS USED 
BIN RESOLUTION RANGE HIGH (A) 
BIN RESOLUTION RANGE LOW (A) 
BIN COMPLETENESS (WORKING+TEST) {%) 
REFLECTIONS IN BIN (WORKING SET) 

BIN R VALUE (WORKING SET) 

BIN FREE R VALUE 

BIN FREE R VALUE TEST SET SIZE (%) 
BIN FREE R VALUE TEST SET COUNT 
ESTIMATED ERROR OF BIN FREE R VALUE 



6 

2.40 
2.55 
94.1 

2875 
0.368 
0.455 

8.7 
275 
0.027 



NUMBER OF NON -HYDROGEN ATOMS USED IN REFINEMENT. 



PROTEIN ATOMS 
NUCLEIC ACID ATOMS 
HETEROGEN ATOMS 
SOLVENT ATOMS 



3531 
0 

102 
48 



B VALUES. 
FROM WILSON PLOT (A**2) 
MEAN B VALUE (OVERALL, A**2) 

OVERALL ANISOTROPIC B VALUE. 



45.0 
45.0 



Bll 
B22 
B33 
B12 
B13 
B23 



<A**2) 
(A**2) 
(A**2) 
(A**2) 
(A**2) 
<A**2) 



-11 
6 



35 
33 
5.02 
0.00 
7.70 
0.00 



ESTIMATED COORDINATE ERROR. 
ESD FROM LUZ2ATI PLOT 
ESD FROM SIGMAA 
LOW RESOLUTION CUTOFF 



(A) 
(A) 
(A) 



0.34 
0.47 
5.00 



CROSS -VALIDATED ESTIMATED COORDINATE ERROR. 

ESD FROM C-V LUZZATI PLOT (A) : 0.45 

ESD FROM C-V SIGMAA (A) : 0.46 

RMS DEVIATIONS FROM IDEAL VALUES. 

BOND LENGTHS (A) : 0.007 

BOND ANGLES (DEGREES) : 1.4 

DIHEDRAL ANGLES (DEGREES) : 26.1 

IMPROPER ANGLES (DEGREES) : 1.24 
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SEQRES 
SEQRES 
SEQRES 
SEQRES 
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SEQRES 
SEQRES 
SEQRES 
SEQRES 
SSBOND 
CRYST1 
ORIGX1 
ORIGX2 
ORIGX3 
SCALE1 
SCALE2 
SCALE 3 



3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 



ISOTROPIC THERMAL MODEL : RESTRAINED 



ISOTROPIC THERMAL FACTOR 
MAIN-CHAIN BOND 
MAIN- CHAIN ANGLE 
SIDE- CHAIN BOND 
SIDE -CHAIN ANGLE 

NCS MODEL 

NCS RESTRAINTS. 
GROUP 1 POSITIONAL 
GROUP 1 B- FACTOR 



RESTRAINTS . 
<A**2) 
(A**2) 
(A**2) 
(A**2) 

NONE 



(A) 
<A**2) 



RMS 


SIGMA 


1.77 ; 


1.50 


3.00 ; 


2.00 - 


2.81 ; 


2.00 


4.53 ; 


2.50 



RMS SIGMA/ WEIGHT 
NULL ; NULL 
NULL ; NULL 



PARAMETER FILE 1 
PARAMETER FILE 2 
TOPOLOGY FILE 1 
TOPOLOGY FILE 2 



parhcsdx.pro 
paramll . wat 
tophcsdx.pro 
tophll . wat 
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The following abbreviations are used in Table 4 in accordance with the format 
and usage established by the Protein Data Bank ("PDB"), Brookhaven National 
Laboratory, Brookhaven, N.Y. These coordinates are on deposit with the PDB, ID 
code lbpl. Atomic coordinates of a BPI protein as described herein appear at Table 
5 4 (pages 62-171 herein) and refinement statistics also appear at the end of Table 4 
(pages 172-173 herein). Table 4 corresponds to Figure 6 (Figs. 6.1-6.112) in United 
States patent application serial no. 08/879,565, filed June 20, 1997. 

"Atom type" refers to the element whose coordinates are measured. The first 
letter in the column defines the element. 
10 "Residue* refers to the amino acid in the BPI protein sequence, using the 

standard three letter abbreviations known in the art. 
refers to the residue number. 
n X, Y, Z n crystallographically define the atomic position, in three-dimensional 
space, of the element measured. 
15 "OCC" is the occupancy value. 

"B" is a thermal factor that measures movement of the atom around its atomic 
center. 

10. Organ omimetics 

Molecular modelling of BPI as described herein is useful for the preparation of 

20 organomimetics such as "surface" mimetics. As one example, organomimetics are 
prepared based on "tip" mimetics in which the three-dimensional coordinates of the 
tip, as described above, are used to create a "surface" (or complementary pocket) into 
which a computer program builds an organic molecule with similar characteristics. 
Numerous modifications and variations in the practice of the invention are 

25 expected to occur to those skilled in the art upon consideration of the foregoing 
description and the presently preferred embodiments thereof. Consequently, the 
scope of the present invention is to be defined by the appended claims. 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 31.. 1491 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 
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(B) LOCATION: 124.. 1491 

(ix) FEATURE : 

(A) NAME/KEY: misc_feature 

(D) OTHER INFORMATION: "rBPI" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CAGGCCTTGA GGTTTTGGCA GCTCTGGAGG ATG AGA GAG AAC ATG GCC AGG GGC 54 

Met Arg Glu Asn Met Ala Arg Gly 
-31 -30 -25 

CCT TGC AAC GCG CCG AGA TGG GTG TCC CTG ATG GTG CTC GTC GCC ATA 102 
Pro Cys Asn Ala Pro Arg Trp Val Ser Leu Met Val Leu Val Ala lie 
-20 -15 -10 

GGC ACC GCC GTG ACA GCG GCC GTC AAC CCT GGC GTC GTG GTC AGG ATC 150 
Gly Thr Ala Val Thr Ala Ala Val Asn Pro Gly Val Val Val Arg lie 
-5 15 

TCC CAG AAG GGC CTG GAC TAC GCC AGC CAG CAG GGG ACG GCC GCT CTG 198 
Ser Gin Lys Gly Leu Asp Tyr Ala Ser Gin Gin Gly Thr Ala Ala Leu 
10 15 20 25 

CAG AAG GAG CTG AAG AGG ATC AAG ATT CCT GAC TAC TCA GAC AGC TTT 246 
Gin Lys Glu Leu Lys Arg lie Lys lie Pro Asp Tyr Ser Asp Ser Phe 
30 35 40 

AAG ATC AAG CAT CTT GGG AAG GGG CAT TAT AGC TTC TAC AGC ATG GAC 294 
Lys lie Lys His Leu Gly Lys Gly His Tyr Ser Phe Tyr Ser Met Asp 
45 50 55 

ATC CGT GAA TTC CAG CTT CCC AGT TCC CAG ATA AGC ATG GTG CCC AAT 342 
lie Arg Glu Phe Gin Leu Pro Ser Ser Gin lie Ser Met Val Pro Asn 
60 65 70 

GTG GGC CTT AAG TTC TCC ATC AGC AAC GCC AAT ATC AAG ATC AGC GGG 390 
Val Gly Leu Lys Phe Ser lie Ser Asn Ala Asn lie Lys He Ser Gly 
75 80 85 

AAA TGG AAG GCA CAA AAG AGA TTC TTA AAA ATG AGC GGC AAT TTT GAC 438 
Lys Trp Lys Ala Gin Lys Arg Phe Leu Lys Met Ser Gly Asn Phe Asp 
90 95 100 105 

CTG AGC ATA GAA GGC ATG TCC ATT TCG GCT GAT CTG AAG CTG GGC AGT 486 
Leu Ser He Glu Gly Met Ser He Ser Ala Asp Leu Lys Leu Gly Ser 
110 115 120 

AAC CCC ACG TCA GGC AAG CCC ACC ATC ACC TGC TCC AGC TGC AGC AGC 534 
Asn Pro Thr Ser Gly Lys Pro Thr He Thr Cys Ser Ser Cys Ser Ser 
125 130 135 

CAC ATC AAC AGT GTC CAC GTG CAC ATC TCA AAG AGC AAA GTC GGG TGG 582 
His He Asn Ser Val His Val His He Ser Lys Ser Lys Val Gly Trp 
140 145 150 

CTG ATC CAA CTC TTC CAC AAA AAA ATT GAG TCT GCG CTT CGA AAC AAG 630 
Leu He Gin Leu Phe His Lys Lys He Glu Ser Ala Leu Arg Asn Lys 
155 160 165 

ATG AAC AGC CAG GTC TGC GAG AAA GTG ACC AAT TCT GTA TCC TCC AAG 678 
Met Asn Ser Gin Val Cys Glu Lys Val Thr Asn Ser Val Ser Ser Lys 
170 175 180 185 



CTG CAA CCT TAT TTC CAG ACT CTG CCA GTA ATG ACC AAA ATA GAT TCT 
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Leu Gin Pro Tyr Phe Gin Thr Leu Pro Val Met Thr Lys lie Asp Ser 
190 195 200 

GTG GCT GGA ATC AAC TAT GGT CTG GTG GCA CCT CCA GCA ACC ACG GCT 774 
Val Ala Gly lie Asn Tyr Gly Leu Val Ala Pro Pro Ala Thr Thr Ala 
205 210 215 

GAG ACC CTG GAT GTA CAG ATG AAG GGG GAG TTT TAC AGT GAG AAC CAC 822 
Glu Thr Leu Asp Val Gin Met Lys Gly Glu Phe Tyr Ser Glu Asn His 
220 225 230 

CAC AAT CCA CCT CCC TTT GCT CCA CCA GTG ATG GAG TTT CCC GCT GCC 870 
His Asn Pro Pro Pro Phe Ala Pro Pro Val Met Glu Phe Pro -Ala Ala 
235 240 245 

CAT GAC CGC ATG GTA TAC CTG GGC CTC TCA GAC TAC TTC TTC AAC ACA 918 
His Asp Arg Met Val Tyr Leu Gly Leu Ser Asp Tyr Phe Phe Asn Thr 
250 255 260 265 

GCC GGG CTT GTA TAC CAA GAG GCT GGG GTC TTG AAG ATG ACC CTT AGA 966 
Ala Gly Leu Val Tyr Gin Glu Ala Gly Val Leu Lys Met Thr Leu Arg 
270 275 280 

GAT GAC ATG ATT CCA AAG GAG TCC AAA TTT CGA CTG ACA ACC AAG TTC 1014 
Asp Asp Met He Pro Lys Glu Ser Lys Phe Arg Leu Thr Thr Lys Phe 
285 290 295 

TTT GGA ACC TTC CTA CCT GAG GTG GCC AAG AAG TTT CCC AAC ATG AAG 1062 
Phe Gly Thr Phe Leu Pro Glu Val Ala Lys Lys Phe Pro Asn Met Lys 
300 305 310 

ATA CAG ATC CAT GTC TCA GCC TCC ACC CCG CCA CAC CTG TCT GTG CAG 1110 
He Gin He His Val Ser Ala Ser Thr Pro Pro His Leu Ser Val Gin 
315 320 325 

CCC ACC GGC CTT ACC TTC TAC CCT GCC GTG GAT GTC CAG GCC TTT GCC 1158 
Pro Thr Gly Leu Thr Phe Tyr Pro Ala Val Asp Val Gin Ala Phe Ala 
330 335 340 345 

GTC CTC CCC AAC TCC TCC CTG GCT TCC CTC TTC CTG ATT GGC ATG CAC 1206 
Val lieu Pro Asn Ser Ser Leu Ala Ser Leu Phe Leu He Gly Met His 
350 355 360 

ACA ACT GGT TCC ATG GAG GTC AGC GCC GAG TCC AAC AGG CTT GTT GGA 1254 
Thr Thr Gly Ser Met Glu Val Ser Ala Glu Ser Asn Arg Leu Val Gly 
365 370 375 

GAG CTC AAG CTG GAT AGG CTG CTC CTG GAA CTG AAG CAC TCA AAT ATT 1302 
Glu Leu Lys Leu Asp Arg Leu Leu Leu Glu Leu Lys His Ser Asn He 
380 385 390 

GGC CCC TTC CCG GTT GAA TTG CTG CAG GAT ATC ATG AAC TAC ATT GTA 1350 
Gly Pro Phe Pro Val Glu Leu Leu Gin Asp He Met Asn Tyr He Val 
395 400 405 

CCC ATT CTT GTG CTG CCC AGG GTT AAC GAG AAA CTA CAG AAA GGC TTC 1398 
Pro He Leu Val Leu Pro Arg Val Asn Glu Lys Leu Gin Lys Gly Phe 
410 415 420 425 

CCT CTC CCG ACG CCG GCC AGA GTC CAG CTC TAC AAC GTA GTG CTT CAG 144 6 

Pro Leu Pro Thr Pro Ala Arg Val Gin Leu Tyr Asn Val Val Leu Gin 
430 435 440 

CCT CAC CAG AAC TTC CTG CTG TTC GGT GCA GAC GTT GTC TAT AAA 1491 
Pro His Gin Asn Phe Leu Leu Phe Gly Ala Asp Val Val Tyr Lys 
445 450 455 
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TGAAGGCACC AGGGGTGCCG GGGGCTGTCA GCCGCACCTG TTCCTGATGG GCTGTGGGGC 1551 

ACCGGCTGCC TTTCCCCAGG GAATCCTCTC CAGATCTTAA CCAAGAGCCC CTTGCAAACT 1611 

TCTTCGACTC AGATTCAGAA ATGATCTAAA CACGAGGAAA CATTATTCAT TGGAAAAGTG 1671 

CATGGTGTGT ATTTTAGGGA TTATGAGCTT CTTTCAAGGG CTAAGGCTGC AGAGATATTT 1731 

CCTCCAGGAA TCGTGTTTCA ATTGTAACCA AGAAATTTCC ATTTGTGCTT CATGAAAAAA 1791 

AACTTCTGGT TTTTTTCATG TG 1813 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 87 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg Glu Asn Met Ala Arg Gly Pro Cys Asn Ala Pro Arg Tip Val 

-31 -30 -25 -20 

Ser Leu Met Val Leu Val Ala lie Gly Thr Ala Val Thr Ala Ala Val 

-15 -10 -5 1 

Asn Pro Gly Val Val Val Arg lie Ser Gin Lys Gly Leu Asp Tyr Ala 
5 10 15 

Ser Gin Gin Gly Thr Ala Ala Leu Gin Lys Glu Leu Lys Arg lie Lys 
20 25 30 

lie Pro Asp Tyr Ser Asp Ser Phe Lys lie Lys His Leu Gly Lys Gly . 
35 40 45 

His Tyr Ser Phe Tyr Ser Met Asp lie Arg Glu Phe Gin Leu Pro Ser 
50 55 60 65 

Ser Gin lie Ser Met Val Pro Asn Val Gly Leu Lys Phe Ser lie Ser 
70 75 80 

Asn Ala Asn lie Lys lie Ser Gly Lys Trp Lys Ala Gin Lys Arg Phe 
85 90 95 

Leu Lys Met Ser Gly Asn Phe Asp Leu Ser lie Glu Gly Met Ser lie 
100 105 110 

Ser Ala Asp Leu Lys Leu Gly Ser Asn Pro Thr Ser Gly Lys Pro Thr 
115 120 125 

lie Thr Cys Ser Ser Cys Ser Ser His lie Asn Ser Val His Val His 
130 135 140 145 

lie Ser Lys Ser Lys Val Gly Trp Leu lie Gin Leu Phe His Lys Lys 
150 155 160 

lie Glu Ser Ala Leu Arg Asn Lys Met Asn Ser Gin Val Cys Glu Lys 
165 170 175 

Val Thr Asn Ser Val Ser Ser Lys Leu Gin Pro Tyr Phe Gin Thr Leu 
180 185 190 

Pro Val Met Thr Lys lie Asp Ser Val Ala Gly lie Asn Tyr Gly Leu 
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195 200 205 

Val Ala Pro Pro Ala Thr Thr Ala Glu Thr Leu Asp Val Gin Met Lys 
210 215 220 225 

Gly Glu Phe Tyr Ser Glu Asn His His Asn Pro Pro Pro Phe Ala Pro 
230 235 240 

Pro Val Met Glu Phe Pro Ala Ala His Asp Arg Met Val Tyr Leu Gly 
245 250 255 

Leu Ser Asp Tyr Phe Phe Asn Thr Ala Gly Leu Val Tyr Gin Glu Ala 
260 265 .270 

Gly Val Leu Lys Met Thr Leu Arg Asp Asp Met He Pro Lys Glu Ser 
275 280 285 

Lys Phe Arg Leu Thr Thr Lys Phe Phe Gly Thr Phe Leu Pro Glu Val 
290 295 300 305 

Ala Lys Lys Phe Pro Asn Met Lys He Gin He His Val Ser Ala Ser 
310 315 320 

Thr Pro Pro His Leu Ser Val Gin Pro Thr Gly Leu Thr Phe Tyr Pro 
325 330 335 

Ala Val Asp Val Gin Ala Phe Ala Val Leu Pro Asn Ser Ser Lieu Ala 
340 345 350 

Ser Leu Phe Leu He Gly Met His Thr Thr Gly Ser Met Glu Val Ser 
355 360 365 

Ala Glu Ser Asn Arg Leu Val Gly Glu Leu Lys Leu Asp Arg Leu Leu 
370 375 380 385 

Leu Glu Leu Lys His Ser Asn He Gly Pro Phe Pro Val Glu Leu Leu 
390 395 400 

Gin Asp We Met Asn Tyr He Val Pro He Leu Val Leu Pro Arg Val 
.405 410 415 

Asn Glu Lys Leu Gin Lys Gly Phe Pro Leu Pro Thr Pro Ala Arg Val 
420 425 430 

Gin Leu Tyr Asn Val Val Leu Gin Pro His Gin Asn Phe Leu Leu Phe 
435 440 445 

Gly Ala Asp Val Val Tyr Lys 
450 455 
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What Is Claimed Is: 

1. Use of atomic coordinates of bactericidal/permeability-increasing ("BPI") 
protein, or fragment, analog or variant thereof, to model a BP1 protein. 

2. Use of atomic coordinates of bactericidal/permeability-increasing ("BPI") 
protein, or fragment, analog or variant thereof, to model a BPI-related lipid transfer protein. 

3. The use according to claim 2, wherein the BPI-related lipid transfer protein 
is lipopolysaccharide-binding protein (LBP), cholesteryl ester transferase protein (CETP) or 
phospholipid transfer protein (PLTP), or fragment, analog or variant thereof. 

4. The use according to any of claims 1-3, wherein the BPI protein comprises a 
binding site characterized by amino acid residues of at least one binding pocket as defined 
in Table 31. 

5. The use according to any of claims 1-3, wherein the BPI protein comprises a 
binding site characterized by at least one amino acid sequence, or variant of the sequence, 
selected from positions about 17 to about 45, positions about 36 to about 54, positions about 
65 to about 99, positions about £4 to about 109, positions about 142 to about 164, or 
positions about 142 to about 169 of BPI. 

6. The use according to any of claims 1-3, wherein the BPI protein comprises a 
binding site characterized by amino acid residues of at least one binding pocket as defined 
in Table 3 and a binding site characterized by at least one amino acid sequence, or variant 
of the sequence, selected from positions about 17 to^put 45, positions about 36 to about 
54, positions about 65 to about 99, positions about 84 tb about 109, positions about 142 to 
about 164, or positions about 142 to about 169 of BPI. 
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7. Use of atomic coordinates of bactericidal/permeability-increasing ("BPI") 
protein to computationally design a chemical compound for mimicking BPI protein, or 
fragment, analog or variant thereof. 

8. Use of atomic coordinates of bactericidal/permeability-increasing ("BPI") 
protein to computationally design a chemical compound for mimicking a BPI-related lipid 
transfer protein, or fragment, analog or variant thereof. 

9. The use according to claim 8, wherein the BPI-related lipid transfer protein 
is lipopolysaccharide-binding protein (LBP), cholesteryl ester transferase protein (CETP) or 
phospholipid transfer protein (PLTP). ^ 

10. Use of atomic coordinates of bactericidal/permeability-increasing ("BPI") 
protein, to design a chemical compound capable of associating with a BPI-related lipid 
binding protein, or fragment, analog or variant thereof. 

11. The use according to claim 10, wherein the BPI-related lipid binding protein 
is lipopolysaccharide-binding protein (LBP), cholesteryl ester transferase protein (CETP) or 
phospholipid transfer protein (PLTP), or fragment, analog or variant thereof. 

12. Use of atomic coordinates of bactericidal/permeability-increasing ("BPI") 
protein to design a model of ligands in an active site of a lipid binding protein. 

13. The use according to claim 12, wherein the lipid binding protein is 
bactericidal/permeability-increasing protein (BPI), lipopolysaccharide-binding protein (LBP), 
cholesteryl ester transferase protein (CETP) or phospholipid transfer protein (PLTP), or 
fragment, analog or variant thereof. 

14; Use of atomic coordinates of bactericidal/permeability-increasing ("BPI") 
protein, to design compounds with at least one activity selected from the group consisting of 
antibacterial, antifungal, antimycobacterial, antichlamydial, antiprotozoal heparin-binding, 
endotoxin-binding, heparin-neutralizing, endotoxin-neutralizing, inhibition of tumor and 
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endothelial cell proliferation, inhibition of angiogenesis, anti-inflammatory, anticoagulantand 
antithrombolytic. 

15. The use according to any of claims 1-14, wherein said atomic coordinates 
are according to Table 4. 

16. A method of three-dimensional modeling of a bactericidal/permeability* 
increasing ("BPI") protein comprising the steps of: 

(a) providing three-dimensional atomic coordinates derived from X-ray diffraction 
measurements of a BPI protein in a computer readable format; 

(b) inputting the data from step (a) into a computer with appropriate software 
programs; 

(c) generating a three-dimensional structural representation of the BPI protein 
suitable for visualization and further computational manipulation. 

17. A method of three-dimensional modeling of a bactericidal/permeability- 
increasing ("BPI")-related lipid transfer protein comprising the steps of: 

(a) providing three-dimensional atomic coordinates derived from X-ray diffraction 
measurements of a BPI protein in a computer readable format; 

(b) inputting the data from step (a) into a computer with appropriate software 
programs; 

(c) generating a three-dimensional structural representation of the BPI-related lipid 
transfer protein suitable for visualization and further computational 
manipulation. 

18. The use according to any of claims 16-17, wherein the BPI protein comprises 
a binding site characterized by amino acid residues of at least one binding pocket as defined 
in Table 3. 

19. The use according to any of claims 16-17, wherein the BPI protein comprises 
a binding site characterized by at least one amino acid sequence, or variant of the sequence, 
selected from positions about 17 to about 45, positions about 36 to about 54, positions about 
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65 to about 99, positions about 84 to about 109, positions about 142 to about 164, or 
positions about 142 to about 169 of BPI. 

20. The use according to any of claims 16-17, wherein the BPI protein comprises 
a binding site characterized by amino acid residues of at least one binding pocket as defined 
in Table 3 and a binding site characterized by at least one amino acid sequence, or variant 
of the sequence, selected from positions about 17 to about 45, positions about 36 to about 
54, positions about 65 to about 99, positions about 84 to about 109, positions about 142 to 
about 164, or positions about 142 to about 169 of BPI. 

21 . A method for providing an atomic model of a BPI protein, or fragment, analog 
or variant thereof, comprising 

(a) providing a computer readable medium having stored thereon atomic 
coordinate/x-ray diffraction data of the BPI protein, or fragment, 

t analog or variant thereof, in crystalline form, the data sufficient to 

model the three-dimensional structure of the BPI protein, or fragment, 
analog or variant thereof; 

(b) analyzing, on a computer using at least one subroutine executed in said 
computer, atomic coordinate/x-ray diffraction data from (a) to provide 
atomic coordinate data output defining an atomic model of said BPI 
protein, or fragment, analog or variant thereof, said analyzing utilizing 
at least one computing algorithm selected from the group consisting of 
data processing and reduction, auto-indexing, intensity scaling, 
intensity merging, amplitude conversion, truncation, molecular 
replacement, molecular alignment, molecular refinement, electron 
density map calculation, electron density modification, electron map 
visualization, model building, rigid body refinement, positional 
refinement; and 

(c) obtaining atomic coordinate data defining the three-dimensional 
structure of at least one of said BPI protein, or fragment, analog or 
variant thereof. 
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22. A method according to claim 21, wherein said computer readable medium 
further has stored thereon data corresponding to a nucleic acid sequence or an amino acid 
sequence data comprising at least one structural domain or functional domain of a BPI, LBP, 
CETP or PLTP corresponding to at least one BPI or mutant primary sequence of Figures 2- 
20 or Table 2, or a fragment thereof; and wherein said analyzing step further comprises 
analyzing said sequence data. 

23. A computer-based system for providing atomic model data of the three- 
dimensional structure of BPI protein, or fragment, analog or variant thereof, a BPI mutant 
or a BPI fragment, comprising the following elements: 

(a) at least one computer readable medium (CRM) having stored thereon 
atomic coordinate/x-ray diffraction data of said BPI protein, or 
fragment, analog or variant thereof; 

(b) at least one computing subroutine that, when executed in a computer, 
causes the computer to analyze atomic coordinate/x-ray diffraction data 
from (a) to provide atomic coordinate data output defining an atomic 
model of said BPI protein, or fragment, analog or variant thereof, said 
analyzing utilizing at least one computing subroutine selected from the 
group consisting of data processing and reduction, auto-indexing, 
intensity scaling, intensity merging, amplitude conversion, truncation, 
molecular replacement, molecular alignment, molecular refinement, 
electron density map calculation, electron density modification, 
electron map visualization, model building, rigid body refinement, 
positional refinement; and 

(c) retrieval means for obtaining atomic coordinate output data 
substantially defining the three-dimensional structure of said BPI 
protein, or fragment, analog or variant thereof. 

24. A method for providing a computer atomic model of a ligand of a BPI protein, 
or fragment, analog or variant thereof, comprising 
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(a) providing a computer readable medium (CRM) having stored thereon 
atomic coordinate data of a BP1 protein, or fragment, analog, or 
variant thereof; 

(b) providing a CRM having stored thereon atomic coordinate data 
sufficient to generate atomic models of potential ligands of said BPI 
protein, or fragment, analog, or variant thereof; 

(c) analyzing on a computer, using at least one subroutine executed in said 
computer, the atomic coordinate data from (a) and ligand data from 
(b), to determine binding sites of BPI protein, or fragment, analog, or 
variant thereof, and to provide atomic coordinate data defining an 
atomic model of at least one ligand of said BPI, BPI mutant or a 
fragment thereof, said analyzing utilizing computing subroutines 
selected from the group consisting of data processing and reduction, 
auto-indexing, intensity scaling, intensity merging, amplitude 
conversion, truncation, molecular replacement, molecular alignment, 
molecular refinement, electron density map calculation, electron 
density modification, electron map visualization, model building, rigid 
body refinement, positional refinement; and 

(d) obtaining atomic coordinate model output data defining the three- 
dimensional structure of said at least one ligand of said BPI protein, 
or fragment, analog, or variant thereof. 



25. A computer-based system for providing an atomic model of at least one ligand 
of a BPI, BPI mutant or a fragment thereof, comprising the following elements; 

(a) a computer readable medium (CRM) having stored thereon atomic 
coordinate data of a BPI, mutant or fragment thereof; 

(b) a CRM having stored thereon atomic coordinate data sufficient to 
generate atomic models of potential ligands of a BPI, mutant or 
fragment; * 

(c) at least one computing subroutine for analyzing on a computer, the 
atomic coordinate data from (a) and (b), to determine binding sites of 
BPI protein, or fragment, analog, or variant thereof, and to provide 
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(d) 
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data output defining an atomic model of at least one potential Irgand 
of BP1 protein, or fragment, analog, or variant thereof, said analyzing 
utilizing at least one computing subroutine selected from the group 
consisting of data processing and reduction, auto-indexing, intensity 
scaling, intensity merging, amplitude conversion, truncation, molecular 
replacement, molecular alignment, molecular refinement, electron 
density map calculation, electron density modification, electron map 
visualization, model building, rigid body refinement, positional 

refinement; and 

I 

retrieval means for obtaining atomic coordinate data of said at least 
one ligand of a BPI protein, or fragment, analog or variant thereof. 
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